International Journal of Interactive Mobile Technologies (iJIM) – eISSN: 1865-7923 – vol. 15, No. 01, 2021 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Two Models Based on Social Relations and SVD++ Method for Recommendation System https://doi.org/10.3991/ijim.v15i01.17751 Ali M. Ahmed Al-Sabaawi () Aksaray University, Aksaray, Turkey alisabaawi5@gmail.com Hacer Karacan Gazi University, Ankara, Turkey Yusuf E. Yenice Aksaray University, Aksaray, Turkey Abstract—Recently, Recommender Systems (RSs) have attracted many re- searchers whose goal is to improve the performance of the prediction accuracy of recommendation systems by alleviating RSs drawbacks. The most common limitations are sparsity and the cold-start user problems. This article proposes two models to mitigate the effects of these limitations. The proposed models exploit five sources of information: rating information, which involves two sources, namely explicit and implicit, which can be extracted via users’ ratings, and two types of social relations: explicit and implicit relations, the last source is confidence values that are included in the first model only. The whole sources are combined into the Singular Value Decomposition plus (SVD++) method. First, to extract implicit relations, each non-friend pair of users, the Multi-Steps Resource Allocation (MSRA) method is adopted to compute the probability of being friends. If the probability has accepted value which exceeds a threshold, an implicit relationship will be created. Second, the similarity of explicit and implicit social relationships for each pair of users is computed. Regarding the first model, a confidence value between each pair of users is computed by di- viding the number of common items by the total number of items which have also rated by the first user of this pair. The confidence values are combined with the similarity values to produce the weight factor. Furthermore, the weight fac- tor, explicit, and implicit feedback information are integrated into the SVD++ method to compute the missing prediction values. Additionally, three standard datasets are utilized in this study, namely Last. Fm, Ciao, and FilmTrust, to evaluate our models. The experimental results have revealed that the proposed models outperformed state-of-the-art approaches in terms of accuracy. Keywords—Recommendation system, SVD++, social relations, data sparsity, cold-start 70 http://www.i-jim.org https://doi.org/10.3991/ijim.v15i01.17751 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System 1 Introduction Recommendation Systems (RSs) play a critical role in providing interested users’ items among the mass of information that is available in online applications. Techni- cally, a recommender system can be categorized in various fields, such as machine learning, text classification, and information retrieval [1][2]. The main aim of the RS is to find the relevant items by understanding users’ behavior and predicting items that might be of interest to the user [3]. Recently, Collaborative Filtering (CF) has become one of the most common prototypes and most widely successful technologies in RS [4]. CF involves two types: memory-based and model-based. User-based and item-based are two techniques of the memory-based method, and both compute the similarity among users/items [5]. On the other hand, the model-based approach utiliz- es machine learning techniques to analyze users’ activity and behavior in a particular online application, the system predicts items for that user according to his behavior [1]. However, CF faces a set of challenges, such as data sparsity, cold-start, and scalability. As data arise, which reaches thousands of users and items that remain without ratings, such a phenomenon is called a data sparsity problem. Moreover, the cold-start problem is the issue of the lack of ability to make predictions for new items or users. Furthermore, there is no history information on cold-starts of users/items; thus, RSs cannot handle this problem [5]. To curtail such shortcomings, many data mining methods have been adopted, including clustering CF and matrix factorization (MF) [6], MF can find the best latent factor in the user-item matrix. MF includes many methods, such as Singular Value Decomposition SVD [7], Probabilistic Matrix Factorization (PMF) [8] and [9], Non-negative Matrix Factorization (NMF) [10], and SVD++ [1]. Many studies have been made to overcome further the problems of RSs. These studies exploited different techniques and integrated various sources such as social relations, context, and personal information to enhance the accuracy of the RS. Social relations have widely been used recently in the RS field [11] and [12], in which the user can return to his/her friends’ relationships before deciding to accept or reject a particular item. Several social factors were applied, such as reliability-based Trust- Aware Collaborative Filtering (TACF) [13], explicit relationships Social Recommendation [14], explicit implicit relationships using Probabilistic Matrix Factorization (EISR) [9], and MSRA-SVD [5]. Explicit relations are the main source of social information. Here, a user can find his interesting items by exploiting the preferences items of his/her explicit friends. Despite the explicit relations contributing to alleviate the data sparsity problem, the problem still remains because most of the users have a limited number of relations. Accordingly, another factor can be adopted which has massive information called implicit relations [15]. In real life, users have similar preferences to each other even when there are no di- rect relations between them in the network. Many methods have utilized explicit rela- tions and others have exploited implicit relations to overcome the data sparsity prob- lem, but still require further sufficient prediction accuracy. It is poor practice to de- pend only on direct connections in the social network, as proposed by [14], and indi- rect relations with one step of the users’ neighbourhood only [9]. The limitation of [9] is neglecting information of other steps of the indirect connection which may impli- iJIM ‒ Vol. 15, No. 01, 2021 71 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System cate vital information. In addition, social interactions among users are established based on trust relations which can be exploited as extra factors to reduce the sparsity problem [13]. Moreover, the limitations of [5] ignored the implicit feedback infor- mation, and the trust relations of the users were not exploited. In this study, the foremost goal increases the accuracy by overcoming the previous limitations, two models are proposed. Both methods adopted four sources of infor- mation: implicit relations which are identified by Multi-Steps Resource Allocation (MSRA), explicit social relations, explicit feedback, and implicit feedback, which can be extracted via rating information. These sources are integrated into the SVD++ method to create the MSRA-SVD++ model. Finally, the confidence factor between users, which is the fifth source of information, it is utilized in the first proposed model only, which is applied and combined with the similarity values to produce the weight- SVD++ model. 2 Material and Methods In this section, the methods and sources that are used in this research are explained. SVD++ method is demonstrated in the next subsection. Whereas the link prediction technique is explained in 2.2. Social relations are clarified in subsection 2.3. Finally, the two proposed models are highlighted in subsections 2.4 and 2.5. 2.1 SVD++ method The SVD++ method, which is utilized by many researchers, is derived from SVD. However, the SVD method exploits explicit information (votes or ratings) that are available in user-item matrices and ignores the implicit information such as clicking, browsing, and purchasing [1]. SVD++ proposed by [16]. For example, assume a 2D user-item matrix (M), wherein the rows p indicates the users, and the columns q repre- sents the items. The rating value where the users can rate the items can be expressed as ru,v. The purpose of the RS is to predict items for users that might interest them. In general, the dataset is huge and many users leave the items without a rating, thus the RS faces a data sparsity problem. However, SVD++ decomposes the user-item matrix into two matrices, p and q, which are orthogonal matrices to represent the users and items, respectively. The decomposition process converts the high dimensional user- item matrix that contains all the data into low-dimensional data pu,f and qi,f, where u is the user number, i the item number, and f indicates the dimension after reduction. In other words, SVD++ reduces the size of matrices p and q to a specific value and finds the best low-dimensional data representation of the user-item matrix by removing any unimportant data. In addition, SVD++ is an expanded version of SVD. The difference is that the implicit feedback is taken into consideration in SVD++, whereas the SVD method depends only on users’ ratings (explicit feedback). Subsequently, a binary matrix is constructed to represent the implicit feedback, where the new matrix is sepa- rated into left and right orthogonal matrices. The following formula shows the predic- tion of SVD++: 72 http://www.i-jim.org Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System ru,i = μ + bu + bi + q T (pu + 1 √|Nu| ∑ yjj∈Nu ) (1) bi = (∑ ru,i−μu∈R(i) ) |Ri| (2) bu = (∑ ru,i−μu∈R(u) −bi) |Ru| (3) where 𝜇 is the average value of the whole dataset, 𝑏𝑢 and 𝑏𝑖 are the observed de- viations from the mean value of user u and item i, respectively. |𝑁𝑢| is the number of items rated by user u, |𝑅𝑢| is the number of users that rated a specific item, |𝑅𝑖| is the number of items that were rated by a set of users, and 𝑦𝑗 is the left orthogonal matrix of the implicit matrix. [17] proposed two regularization parameters (𝜆1 and 𝜆2) which can be added to |𝑅𝑢| and |𝑅𝑖| to compute the user bias and item bias. Eqs (2) and (3), are adopted in this study and modified thus: bi = (∑ ru,i−μu∈R(i) ) λ1+|Ri| (4) bu = (∑ ru,i−μ−biu∈R(u) ) λ2+|Ru| (5) 2.2 Link prediction Social networks encompass a large amount of information, including explicit in- formation such as direct relations (trust relations) and indirect relations (friend’s rela- tions). Another kind of information is implicit relations. Regarding the implicit rela- tions, the link prediction technique is exploited to predict missing information in a social network. Many methods are utilized to find the implicit information by predict- ing new links in a social network, such as Resource Allocation Index (RAI) [15] and Multi-Steps Resource Allocation (MSRA) [18]. According to [5], MSRA achieved better results than RAI, CN ad JC such that it can predict implicit relations with an acceptance ratio of the accuracy (for more details, review reference [5]). MSRA ac- commodates a multi-step process, but in this work two-steps resource transmission is employed to achieve the implicit relations, thus: sxy MSRA2 = sxy MSRA1 + ∑ ∑ 1 kz1 1 kz2 z2 ∈Z2z1 ∈Z1 (6) 𝑠𝑥𝑦 𝑀𝑆𝑅𝐴1 = ∑ 1 |ᴦ(𝑧)|𝑧∈ |ᴦ(𝑥) ⋂ᴦ(𝑦)| (7) ᴦ(𝑥) and ᴦ(𝑦) indicate the sets of the friends of x and y, respectively. Z1 refers to ᴦ(𝑥) ∩ ᴦ(𝑦) and Z2 to ᴦ(𝑧1) ∩ ᴦ(𝑦)/𝑍1. iJIM ‒ Vol. 15, No. 01, 2021 73 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System 2.3 Social relations Social relations provide solutions to overcome the data sparsity problem in RS. In reality, users prefer to ask their friends before deciding on a certain action. The same aspect can be employed in RS. Recently, many applications allow how users to estab- lish their friends’ relationships and the RS can exploit these relations to enhance the performance. Several studies have utilized social relations to boost prediction accura- cy. [14] utilized explicit social relations. For each pair of users, the similarity tech- nique was applied to calculate the similarity between them. Finally, the model was combined with MF. Similarly, in EISR was proposed by [9], explicit and implicit relations were exploited. The same scenario of the previous study was followed, but here implicit relations were added as an extra factor. In [5], the implicit and explicit relations were used, but the difference is using MSRA algorithm instead of RAI algo- rithm, then the similarity was computed and all the social factors with rating infor- mation were incorporated into SVD method. Subsequently, the last two studies were based on implicit relations to represent implicit friends’ relationships, and these rela- tions concluded by link prediction technique that finds missing relations in a social network. Another kind of social relations is confidence statements, which can be di- vided into explicit trust and implicit trust statements [19]. Unlike friend relations, a trust statement means directed relations, such as when user x trusts user y, it is not necessary for user y trusts user x. In some datasets, an explicit trust may be unavaila- ble; therefore, the implicit trust can be estimated from a user-item matrix, by monitor- ing the attitude of the users through their ratings that lead to conclude new resource information, which may increase the prediction accuracy. The following formula proposed by [20] shows the calculation of implicit trust. Tu,v = Au,v/Au (8) Where 𝐴𝑢 is the number of items that are rated by user u. 𝐴𝑢,𝑣 is the set of common items rated by u and v. According to [13], the trust statement can be integrated into the rated information to determine the sparsity problem. Thus, the adjusted weight between users u and v is computed thus: wu,v = { 2∗simu,v∗Tu,v simu,v+Tu,v , simu,v ∗ Tu,v ≠ 0, and simu,v + Tu,v ≠ 0 simu,v, simu,v ≠ 0,and Tu,v = 0 Tu,v , Tu,v ≠ 0,and simu,v = 0 0, otherwise (9) Where the 𝑠𝑖𝑚𝑢,𝑣 denotes the similarity between users u and v, which computes using the Pearson Correlation Coefficient (PCC) as the following. 𝑠𝑖𝑚(𝑢,𝑣) = ∑ (𝑟𝑢,𝑗ℎ−𝑟𝑢̅̅ ̅)(𝑟𝑣,𝑗ℎ−𝑟𝑣̅̅̅) 𝑛 ℎ=1 √∑ (𝑟𝑢,𝑗ℎ−𝑟𝑢̅̅ ̅) 2𝑛 ℎ=1 √∑ (𝑟𝑣,𝑗ℎ−𝑟𝑣̅̅̅) 2𝑛 ℎ=1 (10) 74 http://www.i-jim.org Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Here, 𝑟𝑥,𝑦 is the rating value of item j that is rated by user u. 𝑟�̅� indicates the mean rating of the user x, and n refers to the items that are rated by both users u and v. The similarity value between u and v ranges over [-1,1], so a higher value means high similarity. Although the preceding methods improved the RS performance, it still lacks adequate accuracy. The following subsections propose how to overcome this limitation. 2.4 Weight-SVD++ model Five sources of information are used to feed the matrix factorization technique, namely the rating file that includes explicit and implicit feedback information, explicit relations, implicit relations, and confidence values. The rating information and explic- it social relations sources are obtainable directly from the rating file and social file respectively, while the implicit relations and confidence values sources are computed. Regarding the implicit relations, MSRA is adapted to discover the implicit relations of users within the social network. The user not only can be affected by his friends’ taste but also by other users, even if there is a distance between him and them in the net- work, and especially when those users are friends of his/her friends. Therefore, MSRA is utilized to apply a link prediction technique and predict missing relations that may contain vital information. In addition, confidence values can be exploited to reduce the sparsity problem and consequently increase the prediction accuracy. After acquiring the implicit relations via MSRA, a binary user-user matrix is created, one's values refer to explicit/implicit relations. Afterward, the similarity of the explicit relations and implicit relations is computed using the Pearson correlation coefficient (PCC) which is uses rating values of the users, as shown in Eq. (10). The output after applying the similarity is another user-user matrix with the computed similarity values. Furthermore, the confidence value is computed among all users using Eq. (8); a us- er-user matrix is created that has confidence values. Later, the weight values are com- puted by integrating the user-user matrix of the similarity values, and the user-user matrix of the confidence values, by Eq. (9). Eventually, the weight values and rating values are integrated into the SVD++ method to produce the weight-SVD++ model, details of which are shown in Figure 1. The optimization technique is adopted using Stochastic Gradient Descent (SGD) method, which is used to regularize the square error as follows: ∂E ∂𝑈𝑢 = ∑ (𝑟𝑢,𝑖 − 𝜇 − 𝑏𝑖 − 𝑏𝑢 − 𝑞𝑖 𝑡 (𝑝𝑢 + 1 √|𝑁𝑢| ∑ 𝑦𝑗𝑗∈𝑁𝑢 )) 2 (𝑢,𝑖)∈𝐾 + 𝜆(𝑏𝑢 2 + 𝑏𝑖 2 + ‖𝑝𝑢‖ 2 + ‖𝑞𝑖‖ 2 + ‖𝑦𝑗‖ 2 ) + 𝛽𝑤 ∑ ∑ 𝑊(𝑖,𝑓)‖𝑝𝑖 − 𝑝𝑓‖𝑓𝑟𝑜 2 𝑓𝜖 𝐹(𝑖) 𝑢 𝑖=1 (11) ∂E ∂𝑉𝑖 =∑ (𝑢,𝑖)∈𝑚 (𝑟𝑢𝑖 − 𝜇 − 𝑏𝑢 − 𝑏𝑖 − 𝑞𝑖 𝑡 (𝑝𝑢 + 1 √|𝑁𝑢| ∑ 𝑦𝑗𝑗∈𝑁𝑢 )) 2 + 𝜆(𝑏𝑖 2 + 𝑏𝑢 2 + ‖𝑝𝑢‖ 2 + ‖𝑞𝑖‖ 2 + ‖𝑦𝑗‖ 2 ) (12) iJIM ‒ Vol. 15, No. 01, 2021 75 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Where βw is a constant value that is used to determine the influence degrees of the weight values. F(i) denotes the numbers of users that have weights of user i; and W(i,f) refers to the weight value between users i and f. λ is the constant value that determines the degree of the constraint. These equations are utilized to find the best value for vectors 𝑝𝑢 , 𝑞𝑢 , 𝑏𝑢 , 𝑏𝑖 and 𝑦𝑖 vectors. The following equations show the updating values after each loop: 𝑝𝑢 = 𝑝𝑢 + 𝛾( ∂E ∂𝑈𝑢 𝑞𝑖 − 𝜆𝑝𝑢) (13) 𝑞𝑖 = 𝑞𝑖 + 𝛾( ∂E ∂𝑉𝑖 (𝑝𝑢 + 1 √|𝑁𝑢| ∑ 𝑦𝑗𝑗∈𝑁𝑢 ) − 𝜆𝑞𝑖) (14) yj = yj + γ( ∂E ∂Vu ( qi √|Nu| ) − λyj) (15) 𝑏𝑢 = 𝑏𝑢 + 𝛾( ∂E ∂𝑈𝑈 − 𝜆𝑏𝑢) (16) 𝑏𝑖 = 𝑏𝑖 + 𝛾( ∂E ∂𝑉𝑖 − 𝜆𝑏𝑖) (17) Here, 𝛾 is the learning rate. Then, the prediction value is computed according to Eq. 1. Algorithm 1 shows the implementation steps of the weight-SVD++ model. Algorithm 1: Weight-SVD++ model: 1. Read the rating file and create a user-item matrix. 2. Read the social file and create a user-user matrix as explicit relations. 3. Apply the MSRA algorithm using Eqs. (6) and (7) and create a user-user matrix of implicit relations. 4. Compute the similarity between each pair of users who have an explicit relation or an implicit relation using Eq. (10). 5. Find the confidence values of every pair of users using Eq. (8). 6. Calculate the weight values, which combines the similarity values and confidence values using Eq. (9). 7. Initialize random latent factor matrices for vectors p and q. 8. Compute 𝑏𝑖 and 𝑏𝑢 vectors using Eqs. (4) and (5). 9. Compute the regularization square error using Eqs. (11) and (12). 10. Update 𝑝𝑢, 𝑞𝑖, 𝑦𝑖, 𝑏𝑢, 𝑏𝑖 by using Eqs. (13), (14), (15), (16), (17) respectively. 11. Compute the prediction values using Eq. (1). 76 http://www.i-jim.org Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Fig. 1. Proposed weight SVD++ model considering explicit and implicit relations and confidence values 2.5 MSRA-SVD++ model The second model involves four sources of information, namely explicit feedback, implicit feedback, explicit relations, and implicit relations. Explicit and implicit feed- back information can be derived from the rating matrix, whereas the explicit relations are available directly in the social file. Implicit relations can be computed using the MSRA algorithm. The difference between the first model and the current model is the utilization of the confidence values. In this model, the similarity values of the implicit and explicit relations are adopted only. In general, the same equations, which are used in the previous model, are applied here with a simple difference in Eq. (11), which is modified as follows: ∂E ∂𝑈𝑢 = ∑ (𝑟𝑢,𝑖 − 𝜇 − 𝑏𝑖 − 𝑏𝑢 − 𝑞𝑖 𝑡 (𝑝𝑢 + 1 √|𝑁𝑢| ∑ 𝑦𝑗𝑗∈𝑁𝑢 )) 2 (𝑢,𝑖)∈𝐾 + 𝜆(𝑏𝑢 2 + 𝑏𝑖 2 + ‖𝑝𝑢‖ 2 + ‖𝑞𝑖‖ 2 + ‖𝑦𝑗‖ 2 ) + 𝛽𝑒 ∑ ∑ 𝑠𝑖𝑚(𝑖,𝑓)‖𝑝𝑖 − 𝑝𝑓‖𝑓𝑟𝑜 2 𝑓𝜖 𝐹(𝑖) 𝑢 𝑖=1 + 𝛽𝑖 ∑ ∑ 𝑠𝑖𝑚(𝑖,𝑓 ∗)‖𝑝𝑖 − 𝑝𝑓∗‖𝑓𝑟𝑜 2 𝑓∗𝜖 𝐹∗(𝑖) 𝑢 𝑖=1 (18) where 𝛽𝑒 and 𝛽𝑖 are constant values greater than zero that used to control the im- pact of the explicit and implicit relationships, respectively, F(i) denotes the numbers of explicit friends and F*(i) indicates implicit relationships of user i; and sim(x,y) is the similarity value between users x and y. λ is the constant value that determines the degree of the constraint. Algorithm 2: MSRA-SVD++ model: 1. Read the rating file and create a user-item matrix 2. Read the social file and create a user-user matrix as explicit relations . . . Ui1 Ui2 Uin i ∈ I Vi u ∈ U Rui Uu . . . Wui Uc1 Uc2 Ucn . . . Ue ∈ Explicit relations Ue1 Ue2 Uen Ui ∈ Implicit relations Uc ∈ confidence users Tu,i1 Tu,i2 Tu,in Tu,e1 Tu,e2 Tu,en Tu,c1 Tu,c1 Tu,cn iJIM ‒ Vol. 15, No. 01, 2021 77 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System 3. Apply the MSRA algorithm using Eqs. (6) and (7), then create a user-user matrix of implicit relations 4. Compute the similarity between each pair of users who have an explicit relation or an implicit relation using Eq. (10) 5. Initialize random latent factor matrices for vectors p and q 6. Compute bi and bu vectors using Eqs. (4) and (5) 7. Compute the regularization square error using Eqs. (18) and (12) 8. Update pu, qi, yj, bu, bi by using Eqs. (13), (14), (15), (16), (17) respectively. 9. Compute the prediction values using Eq. (1) Fig. 2. Proposed MSRA-SVD++ model considering explicit and implicit relations 3 Results and Discussion The two detailed implementations of the two models are presented in this section, including the dataset description, evaluation metrics, parameter settings, the bench- marking of the experiments, and the impact of using social relations. 3.1 Dataset description In this study, three standard datasets are used: Last. Fm, Ciao, and FilmTrust. Last. Fm is an online music system that enables users to create their relationships. This dataset was released for the first time in 2011 at the HetRec workshop [21], Last.FM has 2100 users and 18745 items. In this study, users with greater than four relation- ships are included. Thus, the number of users is 1123 and 17892 items. Last. Fm does not have rating values but instead, it has listening counts number and Eq. (19), pro- posed by [22], is applied to map these values from 1 to 5 to present the rating values. . . . Ui1 Ui2 Uin i ∈ I Vi u ∈ U Rui Uu . . . Ue ∈ Explicit relations Ue1 Ue2 Uen Tu,i1 Tu,i2 Tu,in Tu,e1 Tu,e2 Tu,en Ui ∈ Implicit relations 78 http://www.i-jim.org Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System 𝑟 = { ⌊𝑙𝑜𝑔10𝑙⌋ + 1, 𝑖𝑓⌊𝑙𝑜𝑔10𝑙⌋ + 1 ≤ 5 5,𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 } (19) Here, r is the rating integer value, 𝑙 denotes the listening count, and ⌊𝑥⌋ denotes the operation of rounding towards zero. The second dataset is Ciao, which was gathered by [23] and involves of 7375 users and 99746 items. The rating value includes integer values 1-5. A subset of the Ciao dataset is utilized in this study to avoid memory consuming in which the items with ratings of less than two are ignored and users that have one or more ratings with at least one relationship are selected. Therefore, the size of the Ciao dataset became 6767 users and 22229 items. Finally, FilmTrust is the third dataset to be applied. It is a movie rating and trust social relations dataset. FilmTrust was produced by [24]. The ratings are float values [0.5 – 4] with 0.5 increments. Table 1 depicts the statistical of these datasets. Table 1. Statistics of the Ciao, FilmTrust, and Last. Fm datasets Dataset Ciao FilmTrust Last.FM # users 6767 1508 1123 # items 22229 2071 18745 # rating 185759 35497 55,140 % Density 0.0012 1.14 0.0026 # explicit relation 111780 1853 11064 #implicit relation using MSRA 610710 3138 177987 3.2 Evaluation metrics Two well-known metrics are utilized to evaluate the performance of the proposed method, namely the Mean Absolute Error (MAE) and the Root Mean Square Error (RMSE), which are defined by the following formulas: 𝑀𝐴𝐸 = 1 𝑁 ∑ |𝑟𝑢,𝑖 − �̅�𝑢,𝑖| 𝑁 𝑖=1 (20) 𝑅𝑀𝑆𝐸 = √ 1 𝑁 ∑ (𝑟𝑢,𝑖 − �̅�𝑢,𝑖) 2𝑁 𝑖=1 (21) where N is the number of the whole prediction, 𝑟𝑢,𝑖 is the real rated value in the us- er-item matrix for user u and item i, and �̅�𝑢,𝑖 is the computed value of the proposed models. High accuracy results mean the values of both metrics are close to zero. 3.3 Experimental setup The determined optimal experimental settings in this study are shown in the fol- lowing Table 2. iJIM ‒ Vol. 15, No. 01, 2021 79 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Table 2. Parameters setup Dataset Ciao FilmTrust Last.FM βe 0.001 0.005 0.002 βi 0.001 0.005 0.002 βw 0.002 0.002 0.002 α 0.006 0.015 0.01 λe and λi 0.0001 0.001 0.001 λ1 and λ2 4,8 7, 10 10, 25 d 5, 10 5, 10 5, 10 No. of Iterations 5 5 5 To evaluate the proposed methods, state-of-the-art approaches are benchmarked against our study, including RSTE [25], PMF [26], SVD++ [27], SocReg [14], TrustMF [28], MSRA-SVD [5], and UE-SVD++ [29]. In the literature, the most similar studies of this study are TrustMF, SocReg, MSRA-SVD, and UE-SVD++. Thus, for equitable comparison, the proposed models were tested under the same conditions, such as the number of iterations, explicit social relations, the dimensional reduction value, and the number of users. Table 2 shows the parameters setup. Two testing views are displayed in this study. The first view in- cludes all users who have at least one relationship (for Ciao and Last.FM) and one rating (for all datasets), the users were distributed into training and testing parts ran- domly. The second view is the Cold-start users, which means users with ratings of less than five were included in the test part for the Ciao and FilmTrust datasets, whereas only three ratings of 20% of all users in the Last. Fm dataset are randomly selected. 3.4 Benchmarking 5-fold Cross-validation was adopted in this experiment. The datasets were divided randomly into five folds in which four-folds had been used for training (80%) and one for testing (20%). To guarantee test all the data, each execution had been performed five times and the average was computed to produce the outcome of each implemen- tation. Moreover, the experiment was implemented several times and the mean was produced to display the eventual result. Tables 3 and 4 depict the accuracy of the results for all users and cold-start users, respectively. The accuracy results were com- puted through the MAE and RMSE metrics. For both metrics, high values indicated low accuracy. On the other hand, if the values had been close to zero, the results would have been highly accurate. Two results were displayed in both tables when the number of dimensions (d) is 5 and 10. The proposed models were compared with state-of-the-art studies, with PMF, SVD++, and UE-SVD++ that used only rated in- formation and the other studies that utilized social relations. All these studies are shown in Tables 3 and 4. Boldface implies the best results and (*) indicates the sec- ond-best results. Tables 3 and 4 show the influences of social relations to improve the prediction process. Accordingly, comparing the results of PMF and SVD++ against TrustMF, 80 http://www.i-jim.org Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System SocReg, and MSRA-SVD models depict how social relations have a vital rule to boost accuracy. Moreover, the effect of the social relations in cold-start users (Table 4) is greater than that in all users (Table 3). To clarify, cold start users have insuffi- cient rating history to utilize it in the prediction process, whereas for all users, there is reasonable rating history that can be exploited to predict new items for them, and using social relations assist the prediction, but to smaller degree. According to Table 3, the proposed models outperformed all the previous ap- proaches for both models (weight-SVD++ and MSRA-SVD++) and the UE-SVD++ has the closest results in FilmTrust dataset. However, MSRA-SVD has the nearest results to our proposed models in Ciao and Last. Fm datasets. Moreover, MSRA- SVD++ achieved the best performance such that it outperformed our first model (weight-SVD++) in the FilmTrust dataset. Although the weight-SVD++ model uses many sources of information to reduce the sparsity, the accuracy results were lower than the MSRA-SVD++ results. To justify the reason, the weight-SVD++ model uses confidence values as additional information sources to find relations between users. Additionally, dimension reduction methods such as SVD++ achieve good perfor- mance when datasets have acceptable sparsity ratios. In contrast, when the sparsity is reduced to a minimum level (as performed by weight-SVD++), the results will be affected negatively especially with high-density datasets, such as FilmTrust. Nonethe- less, the weight SVD++ model nearly outperformed the MSRA-SVD++ model in terms of MAE and RMSE in the Ciao and Last.FM datasets because these datasets have higher sparsity than FilmTrust, so reducing the sparsity using social relations and confidence values achieved higher accuracy. For example, the improvement that is achieved by the MSRA-SVD++ model comparing with the second-best results in Ciao is about 0.63% and 1.11% for MAE and RMSE respectively when d = 5, and it is about 0.57% for MAE and 1% for RMSE when d = 10. Likewise, the improvement in FilmTrust is approximately 1.4% For MAE and 0.07% for RMSE when d equals 5, Also it is 1.24% and 0.32% for MAE and RMSE respectively when d equals 10. Table 4 presents the results of cold-start users, who have less than five ratings. Ta- ble 4 reveals that MSRA-SVD++ once again exceeded the weight-SVD++ model in FilmTrust. It accomplished the best results in terms of MAE and RMSE, and the im- provement that is achieved by comparing MSRA-SVD++ over the best-second re- sults, when d=10, is about 3.03% and 3.38% for MAE and RMSE in FilmTrust, 3.29% for MAE and 3.83% for RMSE in FilmTrust, and in Last. Fm dataset the im- provement of around 5.29% for MAE and 0.79% for RMSE. Although MSRA-SVD and our present studies are adopted MSRA algorithm to extract implicit relations, the present study has significant improvement in terms of accurate results. Two different reasons caused this improvement. First, adding 𝜆1 and 𝜆2 during the computation of user-bias and item-bias, respectively as shown in Eqs. (4) and (5). Second, the pro- posed method used implicit feedback information source as one of the five sources employed in this study, whereas, MSRA-SVD ignored this source. Additionally, the weight-SVD++ model is superior to MSRA-SVD++ for Ciao and Last.FM datasets in terms of RMSE and MAE such that the improvement that is achieved by weight-SVD++ over MSRA-SVD++ is around 0.03% and 0.04% when d iJIM ‒ Vol. 15, No. 01, 2021 81 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System = 5 in terms of MAE and RMSE, respectively in Ciao. The same superiority can be shown in the Last. Fm dataset, which is 0.57% for MAE and 0.14 for RMSE. In summary of the above results, the weight-SVD++ model can achieve better per- formance when the dataset has less density such that the density is 0.0012% and 0.0026% for Ciao and Last.FM respectively. Furthermore, with high-density datasets such as FilmTrust, which is 1.14%, extra sources of information have an adverse impact; thus, the MSRA-SVD++ model produces higher accuracy results than the weight-SVD++ model. Subsequently, for the dataset with high sparsity, the cold-start user problem can be alleviated using social relations with confidence values, which indicates a vital source of information and it has more effectiveness than the dataset with a lower sparsity ratio. Accordingly, the proposed methods enhanced the results in terms of accuracy and overcame the data sparsity and cold-start user problems by exploiting four/five sources of information: explicit social relations, implicit social relations, explicit rating values, implicit rating feedback (for MSRA-SVD++ model), with confidence values (for weight-SVD++ model). All the results confirm that the proposed method succeeded to enhance the recommendation system performance. Table 3. Evaluation performance for all users Methods Metrics Ciao FilmTrust Last FM d=5 d=10 d=5 d=10 d=5 d=10 PMF MAE 0.920 1.078 0.714 0.735 0.427 0.426 RMSE 1.206 0.822 0.949 0.968 0.537 0.535 SVD++ MAE 0.752 0.748 0.613* 0.611* 0.413 0.412 RMSE 1.013 1.001 0.804 0.802* 0.533 0.532 RSTE MAE 0.767 0.763 0.628 0.640 -------- -------- RMSE 1.020 1.013 0.810 0.835 -------- -------- SocReg MAE 0.899 0.815 0.674 0.668 0.422 0.422 RMSE 1.183 1.076 0.878 0.875 0.531 0.530 Trust MF MAE 0.742 0.753 0.631 0.631 -------- -------- RMSE 0.983 1.014 0.810 0.819 -------- -------- MSRA- SVD MAE 0.7274* 0.7263* 0.6214 0.6174 0.3944* 0.3931* RMSE 0.9589* 0.9572* 0.8351 0.8325 0.5182* 0.5177* UE-SVD++ MAE -------- -------- 0.6203 0.6280 -------- -------- RMSE -------- -------- 0.8026* 0.8120 -------- -------- Weight- SVD++ MAE 0.7222 0.7218 0.6052 0.6056 0.3403 0.340 RMSE 0.9474 0.9467 0.8022 0.8017 0.458 0.4575 MSRA- SVD++ MAE 0.7228 0.7221 0.6044 0.6035 0.3405 0.3403 RMSE 0.9482 0.9476 0.802 0.7994 0.4582 0.4579 82 http://www.i-jim.org Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Table 4. Evaluation performance for cold-start users Methods Metrics Ciao FilmTrust Last FM d=5 d=10 d=5 d=10 d=5 d=10 PMF MAE 1.033 0.926 0.814 0.767 ---- ---- RMSE 1.334 1.191 1.079 1.009 ---- ---- SVD++ MAE 0.759* 0.749* 0.677 0.680 0.4281 0.4272 RMSE 1.039 1.020 0.897 0.905 0.5572 0.5560 RSTE MAE 0.766 0.764 0.680 0.674* 0.4268 0.4251 RMSE 1.101 1.074 0.884 0.900 0.5691 0.5672 SocReg MAE 1.173 0.949 0.881 0.771 ----- ----- RMSE 1.430 1.214 1.104 1.034 ----- ----- Trust MF MAE 0.752 0.770 0.674* 0.687 ---- ---- RMSE 0.954 1.096 0.867* 0.900 ---- ---- MSRA-SVD MAE 0.7662 0.7658 0.6826 0.6821 0.4233* 0.4228* RMSE 0.988* 0.9873* 0.8967 0.8965* 0.5462* 0.5447* Weight-SVD++ MAE 0.7267 0.7266 0.6628 0.6614 0.4003 0.3994 RMSE 0.9493 0.9489 0.8739 0.8747 0.5417 0.5395 MSRA-SVD++ MAE 0.727 0.7263 0.6526 0.6518 0.4026 0.4004 RMSE 0.9499 0.9494 0.8627 0.8621 0.5425 0.5404 3.5 Impact of social relations Figures 3, 4, and 5 display the results of the RMSE metric of different beta values for the MSRA-SVD++ model when dimension value (d) is 10, for Ciao, Last. Fm, and FilmTrust datasets respectively. Beta values are utilized along with similarity values between users, more specifically beta values control the degree of using social rela- tions to reduce the regularization error during the training phase. High beta values refer to using a high similarity ratio between users who have relationships. In this study, a set of beta values is used, which are (0.0001, 0.001/0.002, 0.01, 0.02, 0.03, 0.04, and 0.05) for Ciao and Last. Fm, and (0.0001, 0.005, 0.1, 0.2, 0.3, 0.4, and 0.5) for FilmTrust. As shown in Figures 3-5, the RMSE results are varied. But in general, high beta values cause low accuracy (high values of RMSE), also small beta values lead to low accuracy. The best beta values to produce the highest accuracy are differ- ent from one dataset to another based on the sparsity ratio and the level of similarity among users. For example, the best beta value, in general, is 0.001 for Figure 3, where the result is 0.9476. In contrast, the best beta value in Figure 4 is 0.002, the result is 0.4575. Likewise, the beta value is different in Figure 5, the best result is reported when beta equals 0.005, the result is 0.7995. iJIM ‒ Vol. 15, No. 01, 2021 83 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Fig. 3. Results of RMSE for different beta values on Ciao Fig. 4. Results of RMSE for different beta values on Last. Fm Fig. 5. Results of RMSE for different beta values on FilmTrust 0.8 0.9 1 1.1 1.2 1.3 1.4 0 0.01 0.02 0.03 0.04 0.05 0.06 R M S E 0.456 0.458 0.46 0.462 0.464 0.466 0.468 0.47 0 0.01 0.02 0.03 0.04 0.05 0.06 R M S E 0.798 0.8 0.802 0.804 0.806 0.808 0.81 0.812 0 0.1 0.2 0.3 0.4 0.5 0.6 R M S E 84 http://www.i-jim.org Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System 4 Conclusion This paper proposed two models by integrating into the first model: rating infor- mation, which indicates explicit and implicit ratings, social relations that involve explicit and implicit social relations, also confidence values that are computed by measuring the common rating items of each pair of users. Regarding the second mod- el, the rating and social relations were exploited without confidence values. Both models were tested via three standard datasets, which were utilized to evaluate the performance of our approach. The results revealed that the dimension reduction meth- od presents better performance when it works with an acceptable level of density than working with high density (preferring sparsity over density). Additionally, the com- prehensive experimental outcome showed that social relations have further influence when it was applied to cold-start users, who needed extra information to find their preferences. Therefore, the impact of social relations for the cold-start users is more obvious than for those who have reasonable rating history in the same dataset. More- over, the effectiveness of the social relations increases for the low-density ratio da- tasets, where social relations contribute to enriching the dataset with more information that can support the prediction process. For further development, the confidence equa- tion in the weight-SVD++ model can be modified to keep the density ratio at an ac- ceptable level, or applying another similarity equation. Eventually, the clustering model may improve accuracy by separating users into different communities. 5 References [1] J. Jiao, X. Zhang, F. Li, and Y. Wang, “A Novel Learning Rate Function and Its Application on the SVD++ Recommendation Algorithm,” IEEE Access, vol. 8, pp. 14112– 14122, 2019. https://doi.org/10.1109/access.2019.2960523 [2] S. Sarif, Z. Hanafi, S. Farvin, S. Zaibon, and M. Mohsin, “Makan@Local Chatok: Mobile Eatery Recommendation System Based on Local Knowledge,” Int. J. Interact. Mob. Technol., vol. 14, p. 129, Apr. 2020, https://doi.org/10.3991/ijim.v14i05.13357 [3] R. K. Dewi, M. T. Ananta, L. Fanani, K. C. Brata, and N. D. Priandani, “The development of mobile culinary recommendation system based on group decision support system,” Int. J. Interact. Mob. Technol., vol. 12, no. 3, pp. 209–216, 2018. https://doi.org/10.3991/ ijim.v12i3.7799 [4] X. Su and T. M. Khoshgoftaar, “A Survey of Collaborative Filtering Techniques,” Adv. Artif. Intell., vol. 2009, no. Section 3, pp. 1–19, 2009, doi: 10.1155/2009/421425. [5] A. M. A. Al-Sabaawi, H. Karacan, and Y. E. Yenice, “Exploiting implicit social relationships via dimension reduction to improve recommendation system performance,” PLoS One, vol. 15, no. 4, p. e0231457, Apr. 2020, [Online]. Available: https://doi.org/10.1371/journal.pone.0231457. [6] J. Lin, X.-H. Yan, and B. Huang, “Collaborative filtering recommendation algorithm based on SVD and fuzzy clustering,” Comput. Syst. Appl., vol. 25, no. 11, pp. 156–163, 2016. [7] X. Zhou, J. He, G. Huang, and Y. Zhang, “SVD-based incremental approaches for recommender systems,” J. Comput. Syst. Sci., vol. 81, no. 4, pp. 717–733, 2015. https://doi.org/10.1016/j.jcss.2014.11.016 iJIM ‒ Vol. 15, No. 01, 2021 85 https://doi.org/10.1109/access.2019.2960523 https://doi.org/10.3991/ijim.v14i05.13357 https://doi.org/10.3991/ijim.v12i3.7799 https://doi.org/10.3991/ijim.v12i3.7799 https://doi.org/10.1371/journal.pone.0231457 https://doi.org/10.1016/j.jcss.2014.11.016 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System [8] Y. Cao, W. Li, and D. Zheng, “An improved neighborhood-aware unified probabilistic matrix factorization recommendation,” Wirel. Pers. Commun., vol. 102, no. 4, pp. 3121– 3140, 2018. https://doi.org/10.1007/s11277-018-5332-2 [9] W. Reafee, N. Salim, and A. Khan, “The power of implicit social relation in rating prediction of social recommender systems,” PLoS One, vol. 11, no. 5, 2016. https://doi.org/10.1371/journal.pone.0154848 [10] . Li, C. Chen, H. Chen, and C. Tong, “Towards context-aware social recommendation via individual trust,” Knowledge-Based Syst., vol. 127, pp. 58–66, 2017. https://doi.org/10. 1016/j.knosys.2017.02.032 [11] U. Liji, Y. Chai, and J. Chen, “Improved personalized recommendation based on user attributes clustering and score matrix filling,” Comput. Stand. Interfaces, vol. 57, pp. 59– 67, 2018. https://doi.org/10.1016/j.csi.2017.11.005 [12] S. Rautela and A. Sharma, “Spirituality and Social Media: Connecting the Dots,” Int. J. Interact. Mob. Technol., vol. 13, p. 81, Sep. 2019, https://doi.org/10.3991/ijim. v13i09.8863 [13] P. Moradi and S. Ahmadian, “A reliability-based recommendation method to improve trust-aware recommender systems,” Expert Syst. Appl., vol. 42, no. 21, pp. 7386–7398, 2015. https://doi.org/10.1016/j.eswa.2015.05.027 [14] H. Ma, D. Zhou, C. Liu, M. R. Lyu, and I. King, “Recommender Systems with Social Regularization Categories and Subject Descriptors,” Proc. fourth ACM Int. Conf. Web search data Min., pp. 287–296, 2011, https://doi.org/10.1145/1935826.1935877 [15] T. Zhou, L. Lü, and Y.-C. Zhang, “Predicting missing links via local information,” Eur. Phys. J. B, vol. 71, no. 4, pp. 623–630, 2009. https://doi.org/10.1140/epjb/e2009-00335-8 [16] Y. Koren, P. Ave, F. Park, H. D. Management, and D. Applications, “Factorization meets the neighborhood: a multifaceted collaborative filtering model,” Proc. 14th ACM SIGKDD Int. Conf. Knowl. Discov. data Min., pp. 426–434, 2008, https://doi.org/10.1145/ 1401890.1401944 [17] F. Ricci, L. Rokach, and B. Shapira, Recommender Systems Handbook, 2nd ed. Springer Publishing Company, Incorporated, 2015. [18] Z. Wu and Y. Li, “Link prediction based on multi-steps resource allocation,” in 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014, vol. 1, pp. 355–360. https://doi.org/10.1109/wi- iat.2014.56 [19] R. Chen, Q. Hua, Y.-S. Chang, B. Wang, L. Zhang, and X. Kong, “A survey of collaborative filtering-based recommender systems: From traditional methods to hybrid methods based on social networks,” IEEE Access, vol. 6, pp. 64301–64320, 2018. https://doi.org/10.1109/access.2018.2877208 [20] P. Moradi, S. Ahmadian, and F. Akhlaghian, “An effective trust-based recommendation method using a novel graph clustering algorithm,” Phys. A Stat. Mech. its Appl., vol. 436, pp. 462–481, 2015. https://doi.org/10.1016/j.physa.2015.05.008 [21] I. Cantador, P. Brusilovsky, and T. Kuflik, “Second workshop on information heterogeneity and fusion in recommender systems (HetRec2011),” in Proceedings of the fifth ACM conference on Recommender systems, 2011, pp. 387–388. https://doi.org/10. 1145/2043932.2044016 [22] J. Liu, C. Wu, and W. Liu, “Bayesian probabilistic matrix factorization with social relations and item contents for recommendation,” Decis. Support Syst., vol. 55, no. 3, pp. 838–850, 2013. https://doi.org/10.1016/j.dss.2013.04.002 [23] J. Tang, H. Gao, H. Liu, and A. Das Sarma, “eTrust: Understanding trust evolution in an online world,” in Proceedings of the 18th ACM SIGKDD international conference on 86 http://www.i-jim.org https://doi.org/10.1007/s11277-018-5332-2 https://doi.org/10.1371/journal.pone.0154848 https://doi.org/10.1016/j.knosys.2017.02.032 https://doi.org/10.1016/j.knosys.2017.02.032 https://doi.org/10.1016/j.csi.2017.11.005 https://doi.org/10.3991/ijim.v13i09.8863 https://doi.org/10.3991/ijim.v13i09.8863 https://doi.org/10.1016/j.eswa.2015.05.027 https://doi.org/10.1145/1935826.1935877 https://doi.org/10.1140/epjb/e2009-00335-8 https://doi.org/10.1145/1401890.1401944 https://doi.org/10.1145/1401890.1401944 https://doi.org/10.1109/wi-iat.2014.56 https://doi.org/10.1109/wi-iat.2014.56 https://doi.org/10.1109/access.2018.2877208 https://doi.org/10.1016/j.physa.2015.05.008 https://doi.org/10.1145/2043932.2044016 https://doi.org/10.1145/2043932.2044016 https://doi.org/10.1016/j.dss.2013.04.002 Paper—Two Models Based on Social Relations and SVD++ Method for Recommendation System Knowledge discovery and data mining, 2012, pp. 253–261. https://doi.org/10. 1145/2339530.2339574 [24] G. Guo, J. Zhang, and N. Yorke-Smith, “A novel bayesian similarity measure for recommender systems,” in Twenty-Third International Joint Conference on Artificial Intelligence, 2013. [25] H. Ma, I. King, and M. R. Lyu, “Learning to recommend with social trust ensemble,” in Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, 2009, pp. 203–210. https://doi.org/10. 1145/1571941.1571978 [26] R. Salakhutdinov and A. Mnih, “Bayesian probabilistic matrix factorization using Markov chain Monte Carlo,” in Proceedings of the 25th international conference on Machine learning, 2008, pp. 880–887. https://doi.org/10.1145/1390156.1390267 [27] Y. Koren and R. Bell, “Advances in collaborative filtering,” in Recommender systems handbook, Springer, 2015, pp. 77–118. https://doi.org/10.1007/978-1-4899-7637-6_3 [28] Y. Bo, L. Yu, L. Dayou, and L. Jiming, “Social collaborative filtering by trust,” in International Joint Conference on Artificial Intelligence AAAI Press, 2013, pp. 2747– 2753. [29] W. Shi, L. Wang, and J. Qin, “User Embedding for Rating Prediction in SVD++-Based Collaborative Filtering,” Symmetry (Basel)., vol. 12, no. 1, p. 121, 2020. https://doi.org/10.3390/sym12010121 6 Authors Ali M. Ahmed Al-Sabaawi is a Ph.D. student of Electrical Electronic and Com- puter Engineering, Aksaray University. He received his MSc degree in computer science, at Universiti Teknologi Malaysia in 2013. His research interested lie in the areas of data mining techniques and machine learning applications. Email: alisabaawi5@gmail.com Hacer Karacan is an Assoc. Prof. of Computer Engineering at the Gazi Universi- ty, Turkey. She received a Ph.D. degree in Cognitive Science from the Middle East Technical University in 2007. Her research interests lie in the areas of artificial intel- ligence, interactive system design, machine learning applications and data clustering. Yusuf E. Yenice is an Assist Prof. of Electrical Electronic and Computer Engi- neering at Aksaray University. He received his MSc and PhD degrees in electronic engineering from University of Surrey, United Kingdom, in 1994 and 1999, respec- tively. His research interest is free-space optical communications and data mining techniques. Article submitted 2020-08-13. Resubmitted 2020-10-04. Final acceptance 2020-10-07. Final version published as submitted by the authors. iJIM ‒ Vol. 15, No. 01, 2021 87 https://doi.org/10.1145/2339530.2339574 https://doi.org/10.1145/2339530.2339574 https://doi.org/10.1145/1571941.1571978 https://doi.org/10.1145/1571941.1571978 https://doi.org/10.1145/1390156.1390267 https://doi.org/10.1007/978-1-4899-7637-6_3 https://doi.org/10.3390/sym12010121 alisabaawi5@gmail.com