Abstract
Link prediction is an important research direction in complex network analysis and has drawn increasing attention from researchers in various fields. So far, a plethora of structural similarity-based methods have been proposed to solve the link prediction problem. To achieve stable performance on different networks, this paper proposes a hybrid similarity model to conduct link prediction. In the proposed model, the Grey Relation Analysis (GRA) approach is employed to integrate four carefully selected similarity indexes, which are designed according to different structural features. In addition, to adaptively estimate the weight for each index based on the observed network structures, a new weight calculation method is presented by considering the distribution of similarity scores. Due to taking separate similarity indexes into account, the proposed method is applicable to multiple different types of network. Experimental results show that the proposed method outperforms other prediction methods in terms of accuracy and stableness on 10 benchmark networks.
Introduction
In the last decade, the problem of link prediction in complex networks has captured growing attention from various disciplines [1–4], owing to not only the incompleteness of many available real-world networks [5, 6], but also its importance in theory and reality [7–9]. In biological networks, link prediction gives help to find potential interactions between proteins [10, 11]. In social networks, it is applied in friend recommendation [12, 13]. In co-authorship networks, link prediction can be used for collaboration prediction [14, 15]. Moreover, in e-commerce networks, it conduces to personalized recommendation of commodity [16, 17].
Link prediction aims to uncover the potential links or point out spurious links, with the known information in networks [1, 18]. Generally speaking, potential links include missing and future links. This paper focuses on missing links. To find missing links, traditional attribute-based methods measure the connecting likelihood of links by using the external features, such as demographic or historical information about specific nodes. These methods take the attitude that individuals tend to form links if there are more common features among them [19]. However, many features of nodes suffer from the inaccessible and unreliable problems due to privacy policy in real scenario [1, 8]. Fortunately, structural similarity-based methods provide a new way to solve the problem. These methods only use observed network structures, such as common neighbors [20–22], paths [23–25], and triangles [26–28]. Therefore, they are not affected by privacy information. Usually, these methods use only one or two features to measure node’s similarity, and assume the features work for all networks. However, different networks tend to have different internal structural characteristics [29, 30]. Therefore, these methods are not robust on different networks.
To address this issue, this paper proposes a new link prediction method based on a hybrid similarity model, in which the technique of Grey Relational Analysis (GRA) [31] is adopted to fuse different similarity features. GRA model was originally developed by Deng [31], and as a part of grey system theory, it is suitable for solving problems with complicated interrelationships between multiple factors and variables [32, 33]. Nowadays, GRA has become a well-known model for multiple-attribute decision-making (MADM) and received growing attention from researchers [34–37]. In this paper, link prediction is regarded as an MADM problem, in which potential links are alternatives and similarity indexes are attributes. The GRA is employed to identify the missing links by solving the MADM problem. To the best of our knowledge, this paper is the first work to apply GRA in link prediction.
In the proposed method, four well-known similarity indexes are carefully chosen as the multiple attributes of GRA, which are LP [20, 23], RA [20], CAR [28], and LNB_AA [38]. The reason that these indexes are selected is they are designed according to different but prominent structural features. By using GRA to fuse different similarity indexes, the weight of each index is necessary. To adaptively assign a weight for each index on various networks, a new weight calculation method is designed in this study by borrowing some idea from the precision-to-noise ratio (PNR) [39], which was proposed to estimate the connection likelihood of potential links according to the distribution of their similarity scores. To verify the prediction performance of the proposed method, this paper experimentally compares it with eight baselines on 10 networks derived from various fields. The experimental results show that the proposed method is superior to other methods in terms of accuracy and robustness.
The main contributions of this work are summarized as follows: To solve the link prediction problem, this work regards it as an MADM problem and uses the technique of GRA to address the problem. A new weight calculation method is proposed to adaptively weigh different attributes in our MADM problem. Extensive experiments executed on 10 benchmark networks manifest that the accuracy and robustness of the proposed method outperforms those of the compared ones.
The remainder of the paper is organized as follows. Section 2 introduces the problem description, evaluation metrics, baselines, and Friedman test. Section 3 shows the proposed method in detail. The experimental results and analysis are given in Section 4. Finally, this work is concluded in Section 5.
Preliminaries
Problem description and evaluation metric
Consider an undirected and unweighted network G (V, E), where V denotes the set of nodes and E describes the set of links. Multi-links and self-loops in G are not allowed. Let N = |V| be the number of nodes in G, and U represent the universal set containing all
(1) AUC can be viewed as the probability that the similarity score of one randomly selected missing link (i.e., a link in E ts ) is higher than that of one randomly selected non-existent link (i.e., a link in U - E). In implementation, among n times of independent comparisons, if there are n′ times that the missing links have higher scores and n″ times that they have the same scores, then the value of AUC can be defined as
(2) Precision only concentrates on the top ranked links within a given prediction list. If l links are correctly predicted when considering top-L links, then Precision is defined as
Heretofore, many link prediction methods have been presented by measuring the similarity of nodes in complex networks. This paper chooses several state-of-the-art methods as baselines for the purpose of performance comparison. The brief description of these methods is listed as follows.
(1) Resource allocation (RA) index [20]. Motivated by the resource assignment process that takes place in networks, RA defines the quantity of resource that one node received from another node through their shared neighbors as their similarity, which is
(2) Local path (LP) index [20, 23]. When computing the similarity between two nodes, this index takes paths with length 2 and 3 connecting them into account. Formally, LP is defined as
(3) Local Naïve Bayes (LNB) method [38]. This method employs the Local Naïve Bayes model to calculate the connection likelihood between two nodes. It defines the likelihood score as
(4) CAR index [28]. This method is derived from both node-based and link-based viewpoints. It suggests that two nodes are more likely to link together if their common neighbors are members of a local-community. CAR index estimates the connection probability of node pair (x, y) as
(5) Adaptive fusion model base on logistic regression (LR) [29]. This method is an adaptive fusion model, which predicts missing links using logistic regression. It is mainly based on the following assumptions: (i) the roles of different structural features are different in networks; (ii) even in the same network, the role of a structural feature in different modules is also different [29]. The connection probability of node pair (x, y) is defined as
(6) Adaptive degree penalization (ADP) index [30]. This method is an adaptive degree penalization link prediction method, which tries to estimate the best-performing degree penalization by using the average clustering coefficient observed in the network. The similarity measure is calculated as
(7) Clustering coefficient for link prediction (CCLP) index [27]. Clustering coefficient of a node reflects the density of links between the neighbors of this node. CCLP index uses the clustering coefficients of shared neighbors to estimate the similarity between two nodes, which is
(8) Mutual information (MI) index [41]. This index evaluates the similarity of nodes from the viewpoint of information theory, which computes the connection likelihood of a link using the conditional self-information between two unconnected nodes with common neighbors. MI index defines the similarity between nodes x and y as
In experiments, the Friedman test [42] is introduced to further reveal the statistical significance of the proposed method. This test is a non-parametric statistical hypothesis test that is used to compare multiple methods based on a group of datasets [43]. Suppose k methods are compared on N datasets, all methods on each dataset are sorted in descending order of accuracy, and assigned the rank sequence 1, 2, ⋯, in turn. In case of ties, average ranks are assigned. Let r
ij
denote the rank of the ith method on the jth dataset, and R
i
be the average rank of the ith method, then
However, the Friedman statistic may be undesirably conservative. Later, Iman and Davenport presented a better statistic [43], which is
To accurately predict missing links, a multitude of similarity indexes have been proposed from different perspectives. Usually, a similarity index, which computes similarity scores of node pairs based on only one or two structural features, assumes the features are applicable to all networks. As a result, its performance is unstable on different networks that have diverse inner structural features. To address this issue, this work proposes a novel link prediction method by fusing multiple similarity indexes with different structural features. To synthesize different indexes, the technique of Grey Relation Analysis (GRA) [31] is adopted. In implementation, each similarity index is treated as an attribute, and each potential missing link is regarded as an alternative. At the same time, four classic similarity indexes, namely LP [20, 23], RA [20], LNB_AA [38] and CAR [28], are employed in the proposed method. For convenience, the proposed method is named LPGRA.
GRA method
Since it was originally developed by Deng [31], the GRA technique has been widely applied in many multiple-attribute decision-making problems [34–37]. As part of grey system theory, GRA is appropriate for solving problems with complicated interrelationships between multiple factors and variables [32, 33]. In the existing literature, there have been many different variants of the GRA method [33, 45–47]. This paper uses a simple and efficient GRA method as in [33, 47], which takes some thoughts from TOPSIS (techniques for order preference by similarity to an ideal solution) [48, 49].
Suppose there are m alternatives and n attributes, the value of the ith alternative under jth attribute is expressed as x
ij
. Then, one can get a decision matrix
Step 1: Normalize the decision matrix
Step 2: Compute the weighted normalized decision matrix
Step 3: Obtain the positive ideal solution S+ and negative ideal (anti-ideal) solution S-, respectively. Both solutions are represented as
Step 4: Calculate the grey relational coefficients. The grey relational coefficient between the ith alternative and the positive ideal solution S+ with respect to the jth attribute is calculated as
Similarly, the grey relational coefficient between the ith alternative and the negative ideal solution S- with respect to the jth attribute is computed as
Step 5: Calculate the grey relational grades. The grey relational grade is the average value of relational coefficients of all attributes, which is an overall evaluation of alternatives. The grey relational grade of the ith alternative from the positive ideal solution is defined as
Step 6: Estimate the relative grey relational grade. For the ith alternative, its relative grey relational grade is computed as
The relative grey relational grade is used to measure the relationship between an alternative and the positive ideal solution. An alternative with higher G i is assumed to be a better solution.
This section describes the LPGRA method in detail. LPGRA considers link prediction as a multi-attribute decision-marking problem (MADM) [50], in which potential missing links are considered as alternatives and similarity indexes are treated as attributes. The process of the proposed method LPGRA is outlined in what follows.
Step 1: Determine the decision matrix.
In this paper, the indexes of LP, RA, LNB_AA and CAR are used to compute the similarity scores for potential links. Suppose there are m unconnected node pairs, according to their similarity scores, the decision matrix
In this step, the normalized decision matrix
Step 3: Obtain the positive ideal and negative ideal solutions.
Since similarity scores of node pairs are the higher the better, all attributes are benefit criteria. The positive ideal solution S+ is composed of the maximum of each similarity index. On the contrast, the negative ideal solution S- is formed from the minimum of each similarity index. Mathematically,
Step 4: Calculate the grey relational coefficients and grades.
For each alternative (i.e., potential missing link), its grey relational coefficients and grey relational grade from the positive ideal solution can be calculated according to Eqs. (24) and (26). On the other hand, the coefficients and grade from the negative ideal solution are computed based on Eqs. (25) and (27).
Step 5: Obtain the relative grey relational grade.
Compute the relative grey relational grades for all potential missing links by using Eq. (28), and then rank them in descending order according to their grades. The links at the top are most likely to be missing ones.
1: Randomly sample 10% links from G as validation set E V ;
2:
3: Compute similarity scores of all node pairs in G by I i ;
4: Divide these scores into H uniform bins with size K;
5:
6: n e = number of existing links in bin h;
7: n n = number of non-existing links in bin h;
8:
9:
10: w i = 0;
11:
12: s e = score of e computed by I i ;
13: Suppose s e locates in bin h;
14: w i = w i + PNR (h);
15:
16:
17:
18:
19:
20:
21:
Weight calculation method
In the proposed LPGRA method, one important thing is to determine the weight of each similarity index. To do that, a new weight calculation method is presented by taking the idea of precision-to-noise ratio (PNR) defined in [39]. The PNR was proposed to estimate the connection likelihood of missing links under different scores by simultaneously analyzing the score distributions of existing and non-existing links.
The proposed weight calculation method measures the weight of each similarity index according to the observed network structure. Algorithm 1 outlines the process of the method. The input contains the observed network G, the set of similarity index
Experimental analysis
This section evaluates the performance of the LPGRA method based on 10 real-world networks compared with eight baselines.
Datasets
To fairly evaluate the accuracy of link prediction methods, 10 real-world networks collected from various fields, including social networks, biological networks and technological networks, are employed in experiments. (1) Jazz [54]: a network of Jazz musicians. (2) USAir [1]: a network of the US air transportation system. (3) Email [55]: an email network between members of a university. (4) Facebook (FBK) [56]: a social network collected from https://www.facebook.com/. (5) Dolphin [57]: a social network of 62 dolphins in a community living off Doubtful Sound, New Zealand. (6) Football [58]: the network of American football games between Division IA colleges during regular season Fall 2000. (7) Polblog [59]: a blogging network about US politics. (8) Infectious (INF) [60]: a network of people’s face-to-face contacts in the exhibition "Infectious: Stay Away" in 2009 at the Science Gallery in Dublin. (9) NetScience (NS) [61]: a co-authorships network between scientists working on network theory and experiment. (10) C. elegans (CE) [51]: the neural network of a Caenorhabditis elegans worm.
In this work, all networks are treated as undirected and unweighted, and only the giant component of each network is used. The basic topological features of the giant components of these networks are listed in Table 1. One can observe from Table 1 that the structural characteristics of these networks are various. For examples, Jazz, FBK and Polblog have high average degrees while Dolphin and NS have low average degrees; Jazz and NS are dense networks, whereas Email and FBK are spare ones.
The basic topological features of these 10 networks. N and M are the total number of nodes and links, respectively. 〈k〉 is the average degree and 〈d〉 is the average shortest path distance. C and r indicate the clustering coefficient [51] and assortative coefficient [52], respectively.
denotes the network density,
is the degree heterogeneity [1], and e is the network efficiency [53]
The basic topological features of these 10 networks. N and M are the total number of nodes and links, respectively. 〈k〉 is the average degree and 〈d〉 is the average shortest path distance. C and r indicate the clustering coefficient [51] and assortative coefficient [52], respectively.
In Algorithm 1, there is a parameter K, which denotes the number of distinct similarity scores in each bin. This experiment determines the optimal value of K under the metric of AUC. Figure 1 shows the values of AUC with changes of K. These results are the average of 50 independent implementations with |E ts |/|E|=0.1. It can be seen from the figure that AUC values have extremely slight fluctuations with the changes of K. That is, the method of LPGRA is not sensitive to K. In the following experiments, the value of K is simply fixed as 1.

AUC values obtained by LPGRA with different values of K.
Table 2 exhibits the predicted accuracy of nine methods under the metric of AUC on 10 networks. The results are the average of 50 independent implementations on each network. In each implementation, a network is randomly partitioned into a training set E tr and a testing set E ts , such that |E tr | : |E ts |=9 : 1. The best accuracy for each network is emphasized by boldface. Apparently, LPGRA achieves the best performance on Email, FBK, Dolphin, Football, INF, and NS, and obtains the second best on Polblog and CE. These results show that LPGRA can get fairly decent accuracy. From Table 1, one can see that Email is a very spare network, therefore common neighbor-based methods get lower accuracy than LP, which benefits from the additional information supplied by length 3 paths [20, 23]. However, by taking advantage of GRA and adaptively ascertaining weights of different indexes, LPGRA attains the best AUC value on Email.
AUC values of different methods on 10 networks. The results are the average of 50 independent implementations with |E
tr
| : |E
ts
|=9 : 1. The best performance for each network is emphasized by boldface
AUC values of different methods on 10 networks. The results are the average of 50 independent implementations with |E tr | : |E ts |=9 : 1. The best performance for each network is emphasized by boldface
In addition, LRm also manifests good predicted results. It obtains three best and two second-best based on AUC. Similar to LPGRA, the reason that LRm can get good performance is it aggregates several structural features. However, other baselines do not always give satisfactory results. Take LP index as an example, on some networks, it is ranked second, but ranked seventh or eighth on some others. This phenomenon is caused by the parameter of LP. On different networks, the optimal parameter is quit diverse [20, 23]. However, it is very time-consuming and impractical to determine the optimal parameter of each network. In a nutshell, the proposed method is more stable than baselines on different networks.
Furthermore, the Friedman test [43] is employed to analyze the significant differences between baselines and LPGRA based on the above AUC results. According to Table 2, we get
Next, the changes of AUC of all prediction methods with different proportions of training set E tr in E (from 0.7 to 0.9) is shown in Fig. 3. Evidently, with the proportion increasing from 0.7 to 0.9, the AUC scores show an upward trend. This phenomenon is easy to understand. Increasing the proportion of training set will provide more training information. Conversely, the lower the E tr ratio, the more difficult the link prediction is. As a result, experiments with lower proportions of E tr , such as 0.6 and 0.5, are not further enumerate. More importantly, Fig. 3 presents that the AUC values of the proposed method are either the best or close to the best on all networks. Figure 3(k) depicts the average ranks of different methods under varying proportions of testing sets on all networks, which shows that the average ranks of the proposed method are always the best. Additionally, the significant differences for |E tr |/|E| = 0.8 and 0.7 are analyzed separately. The corresponding values of F F are 20.605 and 20.095, both of which are greater than 2.070 (the critical value of F (8, 72)). That implies that these methods are not equivalent. The results of the Bonferroni-Dunn test are graphically shown in Fig. 4, which again prove the best performance of the proposed method.

Comparison of LPGRA against the others with the Bonferroni-Dunn test. This comparison is based on the results in Table 2. All methods with ranks outside the marked interval are significantly different from LPGRA.

AUC results on 10 networks with different proportions of training set E tr .

The Bonferroni-Dunn test for |E tr |/|E| = 0.8 and 0.7. All methods with ranks outside the marked interval are significantly different from LPGRA.
The comparison of different link prediction methods under the metric of Precision on these 10 networks with different sizes of L is shown in Fig. 5. Unlike AUC, which quantifies the accuracy from the entirety, Precision concerns the prediction accuracy of the top-L predicted links. These results in Fig. 5 prove that the performance of LPGRA is invariably at the forefront on most networks. Nevertheless, the performance of baselines fluctuates wildly across different networks. For example, the precision of RA is the best on NS, but the last on Polblog. In addition, one can observe that with the increase of L, Precision scores of most methods tend to decline. The reason is that the increase of L will reduce the probability to uncover relevant items, and then the value of Precision will be lower.

Precision of different methods on 10 networks with different values of L. The results are the average of 50 independent implementations with |E ts |/|E|=0.1. The size of E ts for Dolphin is 15, so the max L selected is 15. Similarly, the max L selected for Football is 60.
Finally, the changes of Precision with different training set ratios (|E tr |/|E| = 0.7 to 0.9) are depicted in Fig. 6. Here, L = |E ts | for all networks. It can be seen from Fig. 6 that the trend of Precision is different, even opposite, to AUC. When the proportion of training set increases from 0.7 to 0.9, Precision scores present a gradual decline. This scenario was explained in [62]. For the calculation of AUC (see Eq. (1)), the decrease of training set will result in weak n′ and strong n″. As a result, the value of AUC will be lowered [62]. On the other hand, the decrease of training set means increase of testing set. Correspondingly, the probability of getting relevant items will also increase, which causes more missing links to be revealed [62].

Precision results on 10 networks with different proportions of training set E tr .
Overall, the performance of the proposed method is superior to baselines, and thence it is applicable to more networks compared with baselines. The remarkable characteristic of LPGRA is that it integrates multiple structural features of a network via the technique of GRA. By using the weight calculation method, LPGRA is able to automatically adapt to different networks and maintain stable performance.
This paper proposed a new link prediction method LPGRA, which aggregates the results of several similarity indexes. LPGRA regards link prediction as an MADM problem, in which potential links are alternatives and similarity indexes are attributes. The Grey Relation Analysis, a well-known MADM method, is adopted in the proposed method to rank potential links by solving the MADM problem. In the proposed method, to fuse different indexes, the weight of each index is necessary. To this end, a new weight calculation method was designed, which can adaptively assign weight scores for all indexes only according to the observed network structures. The accuracy and stableness of the proposed method were experimentally investigated on 10 benchmarks under the metrics of AUC and Precision. The experimental results demonstrate that the proposed method is superior to baselines in terms of accuracy and stableness. In addition, experiment analysis implies that hybrid similarity model is a feasible way to solve the link prediction problem.
Footnotes
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China (No. 61602225) and the Fundamental Research Funds for the Central Universities (No. lzujbky-2019-90).
