INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL ISSN 1841-9836, 10(5):746-759, October, 2015. Multi-attribute Collaborative Filtering Recommendation C. Yu, Y. Luo, K. Liu Changrui Yu School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China yucr@sjtu.edu.cn Yan Luo* SHU-UTS SILC Business School, Shanghai University, 20 Chengzhong RD, Shanghai 201800, China *Corresponding author: luoyan@shu.edu.cn Kecheng Liu Informatics Research Centre, Henley Business School, University of Reading, Reading, RG6 3XA, U.K. k.liu@reading.ac.uk Abstract: Currently researchers in the field of personalized recommendations bear little consideration on users’ interest differences in resource attributes although re- source attribute is usually one of the most important factors in determining user pref- erences. To solve this problem, the paper builds an evaluation model of user interest based on resource multi-attributes, proposes a modified Pearson-Compatibility multi- attribute group decision-making algorithm, and introduces an algorithm to solve the recommendation problem of k-neighbor similar users. This study addresses the issues on preference differences of similar users, incomplete values, and advanced converge of the algorithm, and realizes multi-attribute collaborative filtering. The algorithm is proved to be effective by an experiment of collaborative recommendation among multi-users in a virtual environment. The experimental results show that the algo- rithm has a high accuracy on predicting target users’ attribute preferences and has a strong anti-interference ability on deviation and incomplete values. Keywords: personalized recommendation, group decision-making, multi-attribute, collaborative filtering, Pearson-Compatibility. 1 Introduction A recommender system aims to generate meaningful recommendations to users for items or products that might interest them [1]. In many markets, consumers face a wealth of products and information from which they can make choices. To alleviate this problem, many web sites attempt to help users by incorporating a recommender system that provides users with a list of items and/or web pages that are likely to interest them. There are real-world operations of industrial recommender systems, such as the recommendations for books on Amazon, or movies on Netflix. As one of the most successful approaches to building recommender systems, collaborative filtering (CF) uses the known preferences of a group of users to make recommendations or predictions of the unknown preferences for other users [2]. The developers of one of the first recommender systems, Tapestry [3] coined the phrase "collaborative filtering (CF)", which has been widely adopted regardless of the facts that recommenders may not explicitly collaborate Copyright © 2006-2015 by CCC Publications Multi-attribute Collaborative Filtering Recommendation 747 with recipients and recommendations may suggest particularly interesting items, in addition to indicating those that should be filtered out [4]. The fundamental assumption of CF is that if users X and Y rate n items similarly, or have similar behaviors (e.g., buying, watching, and listening), and hence will rate or act on other items similarly [5]. Studies in literature [3] and [4] have shown that users’ interest in a product or service is affected by user topic preferences, content preferences, user habits, public evaluation and other factors, and that these factors are decided by the different attributes of items. For example, users liking a new movie may be caused by one or more attributes of the movie, such as the director, star, theme, content, style, public comment and so forth. Thus, in the application of collaborative filtering algorithm, it is necessary to use a multi-attribute analysis model, in which the user rating to an item is based on a different perspective (attributes) to describe their interest preferences. Although user and resource clustering based on the attribute information has been widely discussed in the multi-attribute collaborative filtering literatures, the recommend method is still traditional [6]. Such methods can only obtain a set of potential interest items of target users, but the reasons of such a recommendation are not given to the target user. In addition, the present studies scarcely consider the characteristics differences of similar users interested in the item attributes, which can lead to recommendation deviation [6]. For example, in a traditional way, user B is the most similar one to the target user A because A and B have the same degree of interest in the same film. However, if the film properties they prefer are completely different, it will lead to recommendation deviation when we give greater weight to B. Based on our previous research, we propose that multi-attribute collaborative filtering can be treated as a group decision making process. By building the rating matrix of target items for the similar users, we remove the user who has a large attribute preference difference to target user from the nearest user set, and save the problem of recommendations deviation. In addition, we can analyze the user’s interest performance from the view of item attributes and give the descriptions for the recommendation. Accordingly, this paper proposes a modified Pearson- Compatibility multi-attribute group decision-making algorithm and introduces the algorithm to solve the recommendation problem of k-neighbor similar users. This paper has extended our previous research significantly with much more details of the theoretical model, such as the characters of the multi-attribute evaluation, the selection of the nearest neighbor of target user, and the group decision-making model of personalized recommen- dation. More importantly, we have enriched and refined the collaborative filtering recommenda- tion algorithm which is the core of this paper. This paper is organized as follows. We review recommender systems and multi-attribute utility theory. We then introduce the establishment of user interest model using applied ontology method to describe user profile and illustrate the algorithm in specific steps. Finally, we do an experiment and conclude with findings and discussions. 2 Descriptions of Basic Model 2.1 User rating matrix A user’s comment on a certain item is usually an integration of multi-attribute comments made from different angles [6]. Suppose an item is shown as follows: P = {a1,a2,a3, . . .an} Based on the revised rating model, the paper establishes the user rating matrix. Suppose the user set is denoted as U = {U1,U2, . . .Up} and the user Uj rating for item Pi is denoted as 748 C. Yu, Y. Luo, K. Liu A(Uj,Pi): A(Uj,Pi) = a1 a2 a3 an−1 an    ω11 ω12 ω13 . . . ω1n−1 ω1n a1 ω21 ω22 ω23 . . . ω2n−1 ω2n a2 ω31 ω32 ω33 . . . ω3n−1 ω3n a3 . . . . . . . . . . . . . . . . . . . . . ω(n−1)1 ω(n−1)2 ω(n−1)3 . . . ω(n−1)(n−1) ω(n−1)n an−1 ωn1 ωn2 ωn3 . . . ωn(n−1) ωnn an Where ωxy is the importance of attribute ax of product Pi in comparison with attribute ay for user Uj. Here we use the 1-9 scale Paired comparison method to analyze the compared importance level of each attribute of the product that a user evaluates [7]. The rating matrix of an item is mainly acquired through user scoring, or acquired through user behavior analysis, or acquired with the approaches of Web semantic digging and fuzzy mathematics [6]. The present user rating system, such as Movielens, only asks a user to make a synthetic rating for the movie he or she watched and give a quantitative scoring between 1 and 5. This approach is not accurate in identifying the similarity in the preference of two users. For example, if two users are interested in the same movie. When they rate it, they give it the same score. However, the angles of their preference for the movie are totally different. A user may like the star and the style. The other user prefers the theme and the content of the movie. Therefore, we propose to build a multi-attribute rating system to evaluate a product, i.e., a product has many attributes. When evaluating the product, a user mainly gives the preference ratings of each product attribute. Another notable character of the multi-attribute evaluation is the absolute sparsity of the judgment matrix. This is caused by users who only know a few attributes of the product. This matrix is like the incomplete value judgment matrix of group-decision making, which easily lead to obvious deviations for the prediction results. So we should take some adjusted measures to solve the problem. 2.2 User Interest Model Suppose user Uj has rated several items and the rating matrix set is AS = {A(Uj,P1),A(Uj,P2), . . .A(Uj,Pt)}, where A(Uj,Pi)(i = 1,2, . . . t) is user Uj rating matrix for product i (i.e., Pi). This paper applies the rating matrix set to establishing the user interest model [6]. The specific steps are as follows. 1. Calculating the feature weight vector of each rating matrix, and then acquire the feature weight vector set V S ={V P1Uj (w1,w2,w3, . . .wsize(A(Uj,P1))),V P2 Uj (w1,w2,w3, . . .wsize(A(Uj,P2))) . . . V PtUj (w1,w2,w3, . . .wsize(A(Uj,Pt)))}. Where V PiUj (w1,w2,w3, . . .wsize(A(Uj,Pi))) denotes the feature weight vector of the user rating matrix A(Uj,Pi)(i = 1,2, . . . t) and size(A(Uj,Pi)) denotes the length of the feature weight vector. 2. According to the category of each attribute, calculate the user interest weights of the relavant attribute in the related resource category. Referring to the methods proposed Multi-attribute Collaborative Filtering Recommendation 749 by [8] and [9], we propose the following formula for calculating the degree of the user interest. V a(Uj,ay,n) = n∑ k=1 A(Uj,Pk)×V Pk Uj (wy) n (1) Where V a(Uj,ay,n) denotes the degree to which user Uj is interested in attribute ay. n is the number of the items which has attribute ay and user Uj have rated. A(Uj,Pk)×V Pk Uj (wy) (k = 1,2,3, . . .n) denotes the degree of user Uj interest in attribute ay of product Pk, which indicates how user Uj preference on item Pk is mostly determined by attribute ay. 2.3 Selection of the Nearest Neighbor of Target User We define the target user as the online user which requires evaluations and preliminary recommendations. The set of nearest neighbors is composed of the users who have the most similar interest and preference to item with the target user. Traditionally, researchers use k-Nearest Neighbor (KNN) algorithm and Pearson correlation- based similarity formula to do cluster analysis on the target user and the similar users, according to their similar interest and preference. Through these methods, the similar user set with differ- ent group standards can be obtained. In the process of collaborative filtering recommendation with group decision-making method, we consider the characteristics of the target user preference and search the similar user sets. The specific steps are as follows. Step 1. Search the user set which has similar interest distribution with target user UT , i.e., to obtain the intersection set of the interest distribution of all users and the target user. Two users can rate the same attributes in a number of categories and have similar interest weights. This approach can be applied to obtain the initial Nearest Neighbor set: IU = {(U1,Ω1),(U2,Ω2),(U3,Ω3) . . . ,(Uw,Ωw)} Where w is the number of users totally in the initial Nearest Neighbor set. uk denotes the kth user whose interest set is Sk. Ωk denotes the interest intersection between user Uk and the target user UT . Step 2. Use Pearson correlation-based similarity formula to calculate the similarity be- tween target user UT and a random user Uk. When the degree of similarity reaches the threshold, uk can be divided into SUT which is the similar user set of UT . Finally, SuT = {u1,u2, . . .us} is obtained, i.e., s similar users meet the threshold. The set of the interest intersection between the target user uT and s similar users that meet the threshold is SeuT = Ω1∪Ω2∪Ω3 . . . . . .∪Ωs. 3 Collaborative Filtering Recommendation Algorithm Based on Multi-attribute Group Decision-making 3.1 Group Decision-making Model of Personalized Recommendation After acquiring a similar user set SUT , we need predict and recommend the items that target user UT has not commented yet. Suppose the item set is Source = {P1,P2,S3, . . .Pn} and Pk (k = 1,2, . . .n.) is the item that the target user has not commented yet. The traditional collaborative filtering recommendation algorithm is applied to calculate the overall evaluation value of the item A(UT ,Pk) given by similar users and then obtain the initial recommended source SInitial. Suppose a random item Pk ∈ SInitial has n attributes denoted as a set Spk = 750 C. Yu, Y. Luo, K. Liu {a1,a2, . . .an}, the set satisfies the condition Spk ⊆ SeuT . The comment matrix is denoted as A1,A2,A3, . . .Ap, which means similar users in SuT commend Pk. We suppose p users are similar with the target user and have made comments to the item. The algorithm proposed in this paper requires that p >= 3. When p < 3, refer to the article [10]. After obtaining the evaluation matrix A1,A2,A3, . . .Ap, it vital for this research to use the information of comments to get preference matrix of the target users. The collaborative filtering recommendation of multi-attribute similar users is a typical group decision problem, in which many similar users make their decisions independently without any discussions and then the computer synthesizes their opinions to make recommendations to the target user. The result requires that all similar users’ comprehensive evaluation matrixes and comprehensive character- istics weight vector {w1,w2,w3, . . .wn } be calculated. wi denotes the comprehensive preference of the similar users to attribute ai, which is compared with other attributes of the product that has not been recommended. According to comprehensive characteristics weight vector, we can know which attribute determines the users’ interest on a specific item. Applying the semantic analysis method and the semi-structured description language to the attributes, we can make better recommendation to users. A recommendation model can be transferred to group decision-making model in order to solve a problem. However, the traditional group decision-making algorithm still need be im- proved when applied since there exists differences between similar users and decision-making expert. The differences mainly exist in the following aspects: 1. It is hard to use the weight vectors to measure the influence of similar users in the rec- ommendation processes mainly because of the complexity of user preferences. According to the traditional method, two users have a high similarity in their interest preferences. However, it is hard to identify the deviation existing in the preferences of the two users on a specific item. 2. There are a large number of incomplete values in user comment information because some users may make no comments on the unfamiliar attributes to ensure the accuracy of eval- uations. In this situation, the application of some traditional group decision-making algo- rithms, such as weighted average method or weighted least squared logarithm method may result in a great deviation of the final result. To solve the two problems mentioned above, this paper makes improvements in group decision- making compatibility test algorithm and builds a collaborative correction algorithm based on Pearson-Compatibility model. The core of this algorithm is to simulate the expert group-decision making process, discover the common opinions of most experts via data analysis, and revise the opinions of each expert to realize the final consistent compatibility. Moreover, to solve the prob- lem of premature convergence, this paper adopts a mode of multi-user collaborative correction. In addition to the usual algorithms, when it calculates the value of each user’s impact, this paper uses the common opinion of all users rather than target user‘s opinion as a measurable index. The user whose opinion is more similar to the common opinion would be given a higher weight, which is different to traditional method. 3.2 The Related Definitions Firstly, we introduce the calculation of the value of user impact weight. This value is an important indicator to measure the degree of evaluation information consistency between a user and the others [6]. The user matrix with higher group evaluation consistency will get higher Multi-attribute Collaborative Filtering Recommendation 751 weight. Vise versa. This paper adopts the concept of user rating similarity [11, 12]. We turn all the similar n×n user rating matrixes into n2×1 one dimensional vector. The user Uk judgment matrix Ak could be denoted as V k = {ωk11,ω k 12,ω k 13 . . .ω k 1n,ω k 21,ω k 22,ω k 23 . . .ω k 2n . . .ω k n1 . . . . . .ω k nn}. Pearson similarity formula to calculate the rating matrix between the user Uk and the user Ul is show as follows: Si(Ak,Al) = n2∑ i=1 (V k(i)−V k)× (V l(i)−V l)√ n2∑ u=1 [V k(i)−V k]2 × √ n2∑ u=1 [V l(i)−V l]2 (2) V k is the average value of all elements of user Uk rating matrix. V k = ωk11 + ω k 12 + . . . + ω k nn n2 . The similarity between user k and other users could be calculated as follows: Sik = p∑ l=1,l ̸=k Si(Ak,Al)/(p−1). where p denotes the number of users. We propose a formula Dk = 1 −Sik as the approximate measure of variance, which indicates the deviation degree of evaluation matrix. The approximate influence weight of user Uk is shown as follows: θk = (1−max{Sil, l = 1,2, . . .p})2 D2k (3) After acquiring the similar user influence weight, we suppose the group integrated approximate evaluation matrix of p users is A∗, and the value of each element ω∗ij in matrix A ∗ is as following: ω∗ij = p∑ k=1 θk ×ωkij/ p∑ k=1 θk (4) Matrix A∗ is not a positive reciprocal matrix. Suppose X is a positive reciprocal matrix composed of xij. This paper uses the least square method to modify X and propose the following formula: F(X) = min n∑ i=1 n∑ j=1 (xij −ω∗ij) 2 s.t  xij ×xji = 1xij > 0 (i,j = 1,2, . . .n) (5) The definition of compatibility and comprehensive compatibility are as follows: Definition 1. Suppose X is the group user comprehensive evaluation matrix obtained by using the method of the least squares. Then the judgment matrix compatibility between user k and the other users is as follows: S(Ak,X) = n∑ i=1 n∑ j=1 ω (k) ij ×xij max((ω (k) ij ) 2,(xij)2) n2 −α (6) 752 C. Yu, Y. Luo, K. Liu Although paper [13] has defined expert judgment matrix compatibility in usual cases, it does not consider the incomplete value. Formula (6) is a modified approach to solve the problem. Firstly, the block that the user does not rate is processed and given the value 0. Then α is used to indicate the number of 0. The aim of this approach is to eliminate the influence of user judgment matrix on compatibility indicator. Definition 2. Suppose A1′, A2′ . . .Ap′ are the compatibility correction of matrixes of p users’ judgment matrixes. Then we get the comprehensive consistency indicator S̄, as follows: S̄ = p∑ k=1 S(A,Ak′) p (7) Readers can refer to the simulation result of article [7]. When S(A,B) >= 0.8, the two evaluation matrixes is considered nearly compatible. When S̄ ≥ 0.8, evaluation matrixes of all p similar users is considered compatible. 3.3 Collaborative Correction Algorithm Based On Pearson-Compatibility Pearson-Compatibility model could be used to simulate the process of experts doing group discussions and finally making group decisions. The model mainly consists of two indicator calculation formulae: Pearson similarity calculation and Compatibility test. Pearson similarity calculation formula is mainly used to calculate the information reservation degree of user rating matrix after the matrix has been revised. And Compatibility test is used to test degree of consistency of user rating matrix after matrixes has been revised. The mainly function of Pearson- Compatibility model is to build the associations between two variational indicators. Based on this association, Compatibility correction algorithm will choose the best revised matrix in each iteration process, which could make the user matrixes unanimous and most similar to the true value. Referring to [14], based on the related method [15, 16], this paper proposes a Pearson- Compatibility model as follows: Ω(t) = Si(t) 1−β ×S(t) 1+β (8) Where Ω(t) denotes the calculation result of Pearson-Compatibility model after t times iteration. With all the continuous revision of user matrixes, Si(t) decreases from 1 to 0, and S(t) increases from 0 to 1. So each time of modification generates a better revised scheme which minimizes the change of Si(t) while maximizes the change of S(t). This indicates that all the experts acquires a better compatibility with the least possible original information lose. This method is similar to the real decision processes [16] and can provide an effective way to approach the true value. In Pearson-Compatibility model, the best revised scheme in each iteration process is the one that maximizes Ω(t). β is used to control the marginal diminishing rate and the marginal increasing rate of Si(t) and S(t). The efficiency of this algorithm is low when the consistency of the evaluation matrixes is low. Moreover, it is possible that the algorithm converges in advance, i.e., Ω(t) reaches the max value at the beginning of calculation. At that time, if the value of β increases, the efficiency of the algorithm can be improved and simultaneously solve the problem of pre-mature convergence to some degree. Regarding the designation and realization of compatibility correction algorithm based on Pearson-Compatibility, readers could refer to [17, 18]. In order to solve the problem of pre- mature convergence, this paper introduces a multi-user collaborative correction mechanism which Multi-attribute Collaborative Filtering Recommendation 753 is based on Simulated Annealing Algorithm (SAA). The following experiments in this paper tes- tify that this method can solve the problem of pre-mature convergence in advance, and further improves the accuracy of the experiment result. This algorithm is described as follows: Step1: Suppose u is the iteration times of compatibility test. zs is the number of users whose evaluation matrix could not be revised further in each step, which is given the original value 0. Suppose original user matrixes of all p users are A1(0),A2(0) . . . . . .Ap(0). Then after u times compatibility correction iteration, p users evaluation matrixes are: A1(u),A2(u) . . . . . .Ap(u) Step2: Use formula (2) and (3) to calculate the influence weights of the evaluation matrixes of p users. Step3: Use formula (4) and (5) to calculate the user comprehensive rating matrix X(u) after the uth compatibility correction. Step4: Use formula (6) to calculate the Compatibility between the kth user Ak(u) and X(u). Then apply formula (7) to calculate the comprehensive consistency degree of S̄(u). Step5: When zs = p and S̄(u) < 0.8, go to Step7. When zs ̸= p and S̄(u) < 0.8, go to Step6. When zs ̸= p and S̄(u) ≥ 0.8, we believe p users comprehensive consistency could pass the test, go on with Step8. Step6: Referring to [17], we can revise X(u) and Ak(u). Suppose the deviation matrix of Ak(u) is Dk(u) = {ek(u)ij }, where e k(u) ij = x (u) ij −ω k(u) ij . Suppose e k(u) wr = max(|x (u) ij −ω k(u) ij |) is the maximum deviation item after u times iteration, where ωk(u)ij is nonzero term. Let u = u+1 and begin a new round of revision. The revised rating matrix is: Ak(u) =   x (u−1) ij −e k(u−1) ij ×0.95 µ, when i = w,j = r 1/[x (u−1) ij −e k(u−1) ij ×0.95 µ], when i = r,j = w x (u−1) ij −e k(u−1) ij , when i ̸= w,i ̸= r,j ̸= r,j ̸= w (9) where u is the step length adjusting factor. This paper gives a limitation between 0 and 10 to u. Thus, the range of 0.95µ is [0.5987, 1]. The larger u is, the faster the rate of convergence of rating matrix of user k will be. In order to prevent an oversized adjustment of a single user matrix and thus lose the information in the initial rating matrix, the maximum adjustment extent is set as 0.5987. If u is given an appropriate value in each iteration, the calculating speed and the accuracy of the algorithm result will be highly improved. However, the existing algorithm is not effec- tive enough. This paper proposes an approach of building a multi-user collaborative filtering model based on simulated annealing algorithm (SAA), i.e., applying SAA to revise the maxi- mum deviation item in each iteration and calculate the optimal correction factor µk for each user k(k = 1,2 . . .p). In the simulated annealing algorithm (SAA), each point s of the search space is analogous to a state of some physical system. The goal is to bring the system, from an arbitrary initial state, to a state with the minimum possible energy. According to the principle of Metropolis, the probability of solid particle at temperature T changed from the disordered state to the steady state is exp(− ∆E k×T ). ∆E is the internal energy change quantity when the solid temperature change to T and k means the Boltzm-ann constant. When we apply SAA method to solve the 754 C. Yu, Y. Luo, K. Liu combinational optimization problem, internal energy E means the value of target function and temperature T means the control factor. Then we can get the simulated annealing algorithm. In the SAA, we suppose the initial energy value of target function in the model is E and control factor is T . Then we can control the attenuation amplitude in each iteration and use the random function to simulate the random motion of particle. And we calculate the objective function difference as ∆E, and according to Metropolis principle to choose the result. When T decrease to the critical value or the model solution could not be better, we stop this algorithm and get the approximate optimal result. The SAA applied to the multi-user collaborative correction algorithm could be described as following: Step6.1: Suppose it is the uth iteration. Let formula (8) be the energy function. Suppose c = 0 is the number of iterations for calculating µk, and sc = 0 is the verdict factor to judge whether to stop the iteration. Step6.2: For user Ak(u),(k = 1,2, . . .p), apply formula (9) to confirm the maximal deviation item and to calculate the evaluation set A1(u)c ,A 2(u) c ,A 3(u) c , . . .A p(u) c . Randomly choose the initial state set [µ1c,µ 2 c, . . .µ p c] between 0 and 1. Step6.3: Run Matlab software to generate p random numbers ∆1,∆2, . . .∆p, which are between 0 and 1. Suppose µkc = ∆k (k = 1,2 . . .p, and ∆k in each iteration is different). Let c = c + 1 and calculate the energy function Ω(u)c . Suppose T is the temperature schedule. Its initial value T0 = 100. The attenuation function is Tc = Tc−1 × 0.95. Suppose the counting variable is rt and rt = rt + 1. Let ∆f = Ω(u)c −Ω (u) c−1. Then we make the following judgment: When ∆f > 0, µ1c,µ 2 c, . . .µ p c is accepted as the new state solution and go on with Step4. When ∆f < 0, there are two possibilities. If e( ∆f Tc ) > Random[0,1] holds, µ1c,µ 2 c, . . . µ p c is accepted as the new state solution and go on with Step4. Otherwise, let µkc = µ k c−1, sc = sc +1. Step6.4: When sc > 10 or Tc < 0.01, which means we reach the end condition and export the result µ1c,µ 2 c, . . .µ p c. Otherwise, go on with Step2 after the value of u is obtained. Step7: Popup dialog prompt whether to set a new threshold Sxy which means the user acceptable level. If the user choose to reset, zs value would be zero. And we will let the value of Pearson in each loop iteration compared with Sxy and get the value of zs. Then we execute Step5. If the user choose not reset, we execute Step8. Step8: Export the calculation result X(u), S (u) and p user final evaluation matrixes Ak(u),(k = 1,2, . . .p). We have testify the algorithm is effective in the previous studies. If you are interested in the test process, you can refer to article [16] and [17]. 4 Experimentation We build an experiment environment to execute our algorithm at current conditions to vali- date the effectiveness of this algorithm [6]. The environment is described as follows: We adopt ontology and the relevant methods in order to design and develop the movie information database. Jena 2.6.2 is applied to store the movie information in RDF format and ARQ-2.2 is used to manage the movie information. We have imported 300 movies which involve 10 categories. A semantic analysis of each movie is conducted to get key words and form the initial attribute set. Then the synonyms and the similar words in the initial set are combined. Take some topical words as the characteristic attributes and use them to represent Multi-attribute Collaborative Filtering Recommendation 755 these movies. Finally, 15 attributive categories and 282 concrete attributes are extracted. Then an online multi-attribute rating system based on the movie database and a collaborative filtering recommendation system based on group-decision making are designed and developed. The concrete process that tests the algorithm is as follows [6]: 1. Select four evaluated movies in which G(u,p) is comparatively big and use them as the testify set. They respectively include 6, 7, 8 and 9 attributes. Then, use the target user evaluation matrixes which are further used as the real weight vectors to calculate the user interest vectors for each movie. 2. Based on the user-evaluated movies set (excluding the 4 movies in the test set), apply the methods in sections 2.3 and 2.4 to searching the most similar user set for the target user (i.e., the similar interest distributions). Take Movie 1 with 6 attributes as an example. The real interest vectors are S = [3.7288, 2.7053, 1.9627, 0.4657, 0.3293, 0.3293]. The total score of this movie is 4.5 which indicates that the target user has a high preference to this movie. Moreover, the preference is mainly determined by the first three attributes. Totally, 9 similar users have evaluated this movie. Firstly, the traditional collaborative filtering algorithm is applied to obtaining the weighted average of the total score of this movie and gets the result 3.94. We are not sure whether the target users have interests in this movie. Thus, we need use the similar user evaluation matrixes to make judgments. The evaluation matrixes of six similar users are listed as follows: A =   1 2 2 5 7 9 1/2 1 1 6 5 6 1/2 1 1 7 7 6 1/5 1/6 1/7 1 1 2 1/7 1/5 1/7 1 1 3 1/9 1/6 1/6 1/2 1/3 1   B =   1 2 3 3 7 7 1/2 1 0 4 7 6 1/3 0 1 0 4 5 1/3 1/4 0 1 1 2 1/7 1/7 1/4 1 1 3 1/7 1/6 1/5 1/2 1/3 1   C =   1 2 2 4 0 0 1/2 1 1 3 0 0 1/2 1 1 2 0 0 1/4 1/3 1/2 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1   D =   1 2 3 4 6 8 1/2 1 2 2 4 6 1/3 1/2 1 1/2 4 7 1/4 1/2 2 1 3 2 1/6 1/4 1/4 1/3 1 2 1/8 1/6 1/7 1/2 1/2 1   E =   1 2 0 7 8 0 1/2 1 2 8 7 7 0 1/2 1 0 7 8 1/7 1 0 1 1 2 1/8 1/7 1/7 1 1 1 0 1/7 1/8 1/2 1 1   F =   1 2 4 0 7 0 1/2 1 2 6 7 7 1/4 1/2 1 0 2 2 0 1/6 0 1 0 2 1/7 1/7 1/2 0 1 1 0 1/7 1/2 1/2 1 1   3. Apply the four algorithms to calculate the score of the four movies and make comparisons on the deviations of the real weight vectors of the target users. The result is listed as follows: Table 1: The comparison between algorithms Movie 1 (6 order) Movie 2 (7 order) Movie 3 (8 order) Movie 4 (9 order) Arithmetic weighted average method 0.1589 0.0564 0.1985 0.1132 Logarithmic least squares method 0.1054 0.0534 0.1398 0.0831 Compatibility correction algorithm 0.0877 0.0556 0.1042 0.0687 Our algorithm 0.0780 0.0543 0.0885 0.0683 756 C. Yu, Y. Luo, K. Liu When the scores of a part of similar users have a large deviation from those of the other users, the algorithm proposed in this paper can solve the problem of early convergence better than the other algorithms and obtain an accurate result, as shown in Table 1. The core of our algorithm is the revised values of the comprehensive evaluation matrix determined by the majority of users. Accordingly, the highly deviated evaluation values are revised. The result of seven order matrix experiment shows that the deviations of the result of any algorithms are not notable when all the similar users have unanimous evaluation matrixes, The result of nine order matrix experiment shows that the result of the proposed algorithm is similar to that of compatibility correction algorithm when all the similar users have unanimous evaluation matrixes, while still have some incomplete values, and is better than the other two algorithms obviously. When there are 5 similar users and six order evaluation matrix is executed with our algorithm, the change tendencies of the main indicators are shown in Figure 1: 0 2 4 6 8 10 12 14 16 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1 Iterations P e a rs o n -C o m p a ti b il it y V a lu e Compatibility Value of five similar users Pearson Value of Five similar users Pearson-Compatibility Value of five similar users Figure 1: The main indicators change in our example 4. For example, for the first movie with the six order evaluation matrix, the influence of the user number on the accuracy of recommendation results is examined. Suppose the user number is 3, 5, 7 and 9. The accuracy and the number of iterations are calculated with different means of permutation and combination. Part of the result is shown in Table 2: Table 2: The comparison of different similar user numbers User number 3 5 7 9 initial Indicator of Comprehensive consistency degree 0.5872 0.6890 0.6872 0.7081 Deviation of result 0.1680 0.1093 0.0828 0.0780 Number of iterations 12 16 28 37 More deviation items are generated as the user number increases. Therefore, the iterations of this algorithm rise. This test indicates that the effectiveness of this algorithm is highly related to the initial consistency degrees of all users and the number of users. In general, when the initial consistency degree is low and the similar user set is limited (e.g. there are 3 users), it is hard for the algorithm to dig out the common information among the users. Therefore, the result deviation is huge. However, when the number of similar users increases to a certain degree (e.g. the number is equal or bigger than 7), the algorithm still remains a good accuracy, even if the initial compatibility is low. Regarding the provision of personalized services to the target users, this paper calculates the comprehensive evaluation weight vectors of each movie with the group-decision making model. Multi-attribute Collaborative Filtering Recommendation 757 Take the first movie with 6 attributes as an example. The comprehensive evaluation score of nine similar users is G(u,p) = 3.94. The comprehensive evaluation vectors are V = [4.0653 2.9492 1.7630 0.3972 0.3044 0.3607]. Each value of the weight vector represents the potential interest degree of the target user on the corresponding product attributes. Thus, the total score calculation formula is TScore = G(u,p)× n∑ i=1 Vi/n (10) where TScore denotes the total score, n denotes the number of attributes, and Vi denotes the comprehensive evaluation value of the (i)th attribute of the product. The recommendation set can be fixed through the way of ranking or threshold setting. The total scores of the four movies is shown in Figure 2: 0 1 2 3 4 5 6 7 T S c o re 6.4616 5.9400 5.6734 4.6622 Movie1 Movie1 Movie3 Movie4 Figure 2: The TScore of four movies The first movie has the highest score with six attributes, as shown in Figure 2. The characters of this movie are analyzed as follows using the user interest model. Firstly, attribute Vi of weight vector V is normalized and generate vector V = [0.4131,0.2997,0.1792,0.0404,0.0309,0.0367]. The three attributes whose values are bigger than the average value 0.1666 are picked out. When the attribute value is bigger than 0.1666, the majority of users have evident preference on movie one. In the target user interest model, there are 124 attributes totally. The 3 attributes of movie one that are bigger than the average value are connotation, characteristic and special efficiency are still larger than the average value (1/124 = 0.0081) among 124 target user attribute preferences. This result indicates that target user has evident preference to these 3 attributes and the popularity of this movie is mainly determined by these attributes. Therefore, we could introduce movie one to target user and provide the reasons why this movie is introduced. We also could use semantic analysis technique to describe each attribute in detail and provide more personal service to target user. The comparison analysis histogram is shown in Figure 3. M bar means the percentages which attributes are ranking at top 3 in movie one, U bar means percentages which attributes are ranking at top 3 in target user interest model. The analysis show us that target user may have larger interest preference to attribute one in movie one. 5 Conclusion The traditional collaborative filtering personal recommended algorithms seldom consider the multi-attribute problem. Our approach is based on group-decision making. We propose an 758 C. Yu, Y. Luo, K. Liu 0 0.1 0.2 0.3 0.4 0.5 0 0.01 0.02 0.03 0.04 0.05 U barM bar Connotation Special EfficacyPlot 0.4131 0.2997 0.1792 0.0450 0.0254 0.0326 Figure 3: Comparison analysis histogram of movie one with target user interest. improved Pearson-Compatibility algorithm which is applied to the collaborative filtering recom- mend field. We then build a virtual recommend environment and testify the effectiveness and feasibility of this algorithm. The advantages of the collaborative filtering personal recommended algorithm based on group-decision include: Identifying a more suited similar users set for the target user. An accurate target user model could be set up via field subdivision according to field attributes. Then the users who have similar interest distribution with target user can be found. The similar user set is generated. Providing more accurate and personal recommend service to the target user. The traditional collaborative filtering method could neither recommend a result set to target user, nor provide analysis service [6]. This weakness is overcome by making an information integration to know what are mainly factors determining the user preference, so that we could handle the user need more accurate. Considering evaluation deviation between the similar users and revising the user evaluation. Instead of weighted mean, group-decision making method is used to calculate the comprehensive evaluation score. Deleting the deviation item and revising the evaluation matrix could make the result have a better fitting effect [6]. The collaborative filtering method based on Pearson- Compatibility is applied to the personal recommended field. The result of the experiment shows that the algorithm is stable when dealing with the deviation items and identifies the common preference information between similar users. Acknowledgment This research work is supported by Innovation Program of Shanghai Municipal Education Commission (No. 12ZZ070, No. 12YS018) and Humanity and Social Science Youth Foundation of Ministry of Education, China (No. 12YJC630136). Bibliography [1] G. Adomavicius, A. Tuzhilin (2005); Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions, IEEE Trans. on Knowl. and Data Eng., 17(6):734-749. Multi-attribute Collaborative Filtering Recommendation 759 [2] Xiaoyuan Su, Taghi M. Khoshgoftaar(2009); A survey of collaborative filtering techniques, Advances in Artificial Intelligence, 2009:1-19. [3] D. Goldberg, D. Nichols, B. M. Oki, D. Terry(1992); Using collaborative filtering to weave an information tapestry, Communications of ACM, 35(12):61-70. [4] P. Resnick, H. R. Varian(1997); Recommender systems, Communications of the ACM, 40(3):56-58. [5] K. Goldberg, T. Roeder, D. Gupta, C. Perkins (2001); Eigentaste: a constant time collab- orative filtering algorithm, Information Retrieval, 4(2):133-151. [6] C. Yu, Y. Luo, K. Liu(2014); A multi-attribute collaborative filtering recommendation al- gorithm based on improved group decision-making, ICISO 2014, IFIP AICT, 426:320-330. [7] F. Herrera, E. Herrera-Viedma, F. Chiclana (2001); Multiperson decision-making based in multiplicative preference relation, European J. of Operational Research, 129:372-385. [8] A. Shepitsen, J. Gemmell, B. Mobasher, R. Burke (2008); Personalized recommendation in social tagging systems using hierarchical clustering, Proc. of the 2008 ACM conference on Recommender systems, Lausanne, Switzerland. [9] A. Ypma, T. Heskes (2002); Categorization of Web pages and user clustering with mixtures of hidden Markov models, Proc. of the WEBKDD 2002 Workshop: Web Mining for Usage Patterns and User Profiles, SIGKDD 2002, Edmonton, Alberta, Canada. [10] Z.S. Hua, B.G. Gong, X.Y. Xu (2008); A DS–AHP approach for multi-attribute decision making problem with incomplete information, Expert Systems with Applications, 34(3):2221- 2227. [11] H. Jeon, T. Kim, J. Choi (2010); Personalized information retrieval by usingadaptive user profiling and collaborative filtering, AISS: Advances in Information Sciences and Service Sciences, 2(4):134-142. [12] Liu, A., Yang, Z.(2010); Watching, thinking, reacting: a human-centered framework for- movie content analysis, International Journal of Digital Content Technology and its Appli- cations, 4(5):23-37. [13] Y. Dong , Y. Chen, S. Wang (2004); Algorithm of solving weights for group decision making by improving compatibility, Systems Engineering-theory & Practice, 2004-10. [14] C.-S. Yu (2002); A GP-AHP method for solving group decision-making fuzzy AHP problems, Computers & Operations Research, 29(14): 1969-2001. [15] Y. Zhang, H. Wang (2002); Development and application of P-S aided decision system, Systems Engineering-Theory Methodology Application, 2002-04. [16] L. Liang, L. Xiong, G. Wang (2004); A new method of determining the reliability of decision- makers in group decision, Systems Engineering, 22(6):91-94. [17] J. Barzilai, F. A. Lootsma (1994); Power relations and group aggregation in the multiplica- tive AHP and SMART, Proc. 3rd Int. Symp. AHP, 157-168. [18] J. Sun, W. Xu, Q.D. Wu(2005); A new algorithm for incomplete matrixes’ compatibility improvement and group ranking in group decision making, Systems Engineering-Theory & Practice, 10(10):89-94.