Link strength diffusion model for influence diffusion in weighted social networks

Abstract

Diffusion models are critical for spreading influence in social networks, as they contribute in estimating the potential reach of seed nodes, allowing for better evaluation of their goodness and effectiveness. This research aims to develop a diffusion model for weighted social networks, addressing a gap in traditional models that typically only consider the binary state of relationships between nodes. While existing models treat edges as simply existing or not, real-world social networks are inherently weighted, with edge weights reflecting the strength of the link between connecting nodes. To better capture these dynamics, we propose the Link Strength Diffusion (LiSt-D) model, which incorporates the strength of shared relation to determine the probability of diffusion between social actors. The LiSt-D model was tested on three real-world weighted social networks, Bitcoin Alpha, Bitcoin OTC, and Advogato, using seven heuristics to select the initial seed nodes. The findings showed that diffusion spread under LiSt-D covered 90% of the Bitcoin Alpha network, 84% of Bitcoin OTC, and 63% of the Advogato network. LiSt-D model captures the heterogeneity in the link-strength connecting nodes, and demonstrates early peak performance with high stability.

Keywords

diffusion model weighted social network link strength influence edge weight

1 Introduction

Online social networks play a major role in information dissemination and social communications in today's era, transforming the way people connect, interact, and form opinion. Real-world tasks pertaining to social awareness, opinion monitoring, viral marketing, political campaigning, more often than not make use of online social networks for information dissemination and opinion shaping.^1–8 Viral marketing leverages the organic spread of information within social networks, capitalizing on word-of-mouth dynamics.^9–12 Free samples are given to select individuals deemed influential, with the belief that they would endorse the product to their connections, who in turn pass it on to their networks, and so on. Steered by viral marketing, the phenomenon of influence maximization (IM) has come under research spotlight, thereby making it a favoured domain with works focussing on diffusion of information, innovations and views across social networks. Initially explored in probabilistic contexts^9,11 and later transformed into an optimization task,¹³ IM addresses the challenge of selecting k initiators (or nodes) that can assist in “maximizing the spread of influence through relationships in the social network.¹⁴” However, the goodness (efficacy) of these k initiators, which is the selecting criteria, is always evaluated based on their anticipated influence spread simulated using a diffusion model. Existing works have well established that the core of IM rests heavily on the underlying diffusion model employed, as the influence coverage of the selected k initiators is always considered with respect to a certain diffusion model.^2,14–17 Some popular diffusion models often used to study diffusion in social networks are Susceptible-Infected-Recovered (SIR),¹⁸ Independent Cascade (IC),¹⁴ IC-Weighted (IC-W),¹⁴ Linear Threshold (LT),¹⁴ and Trivalency (TV).¹⁹

Majority of the existing works in the addressed subject area focus on unweighted social networks, while overlooking the fact that most social networks in real-world are weighted networks, like friendship networks, rating networks, collaboration networks, trust networks etc.^20,21 In a weighted social network, edges have values associated with them, which signify the strength of the link (relationship) connecting a pair of nodes.²² Edge weight can have different interpretations, like in friendship networks it may represent the strength of the bond shared between friends, while in ratings network it may depict the trust level between connecting users. It has been argued that “a social tie's strength is often a function of its duration, emotional intensity, intimacy, and exchange of services²³” and social influence is contingent on factors like relationship strength, user characteristics, network properties, distance between users, and temporal aspects.^24–26 Considering relationships regarding their emotional intensity and intimacy has been found to be socially meaningful.²⁷ In real life, people have various types of relationships or linkages with others, such as acquaintances, business contacts, or best friends, and not all carry the same significance. Diverse interaction patterns can be observed in real-world social networks, based on the intensity of the relation shared by the connecting nodes. Weighting captures this diversity by quantifying the link strength between individuals.^28–30 Further, most social relationships are directed, and the intensity of the relation might vary in both directions. Thus, a (directed) weighted network can be considered to aptly represent a real-life social network. In a (directed) weighted social network, the link strength is read in the direction from the sender to the receiver, i.e., relationship strength is assessed from the sender's perspective.

Furthermore, most existing works have focused on developing algorithms for initiator (seed node) identification, while much less emphasis is placed on developing diffusion models. Most existing studies involving weighted social networks have been found to mostly use IC¹⁴ and SIR¹⁸ models as the underlying diffusion mechanism, both of which assume uniform infection (influencing) probabilities for all links in the network, while focussing only on the presence or absence of a link between the connecting nodes,²² without taking the heterogeneity of their strength into consideration. However, we believe that when working on influence diffusion in weighted social networks, link strength should be incorporated in the underlying diffusion mechanism.

Research Highlights: In our research, we propose Link Strength-Diffusion (LiSt-D), a diffusion model for (directed) weighted social networks which accounts for link strength (depicted by edge weights) when computing the diffusion probability. Link strength generally varies between each pair of social actors, with some being strong, and some weak. The proposed LiSt-D model takes into consideration the impact of the heterogeneity in relations that exist between two users, as seen from the sender's perspective. Mentioned below are the significant contributions of this study:

A diffusion model for (directed) weighted social networks, Link Strength Diffusion (LiSt-D), which incorporates relation strength in the computation of diffusion probability between two social actors.

The proposed LiSt-D model considers the link strength between social actors which introduces a variation in the influence diffusion probability between the two nodes, which closely resembles the social behaviour of people in real life.

Simulation of diffusion using the proposed LiSt-D model is conducted with regards to the total influence spread on 3 real-world weighted social networks.

To the best of our knowledge, the presented work is among the preliminary studies focussing on the development of a model for diffusion in (directed) weighted social networks, wherein the likelihood of diffusion from source to target node is based on the strength of the link connecting them.

The rest of this paper is organized as follows: in Section 2, we delve into discussions on research concerning the diffusion of influence in social networks. Section 3 offers a brief overview of fundamental concepts relevant to our study. Our research motivation is outlined in Section 4, followed by the introduction of the LiSt-D model for influence diffusion in (directed) weighted social networks. Section 5 is dedicated to detailing the experimental setup and analysing the obtained results. To end, inferences are drawn in Section 6.

2 Related work

Numerous research works have tackled the task of increasing the dispersion of information, influence, and perspectives within social networks. The general problem of IM in social networks is about setting apart a tiny set of seed nodes that serve as the first information adopters and aid in the start of a diffusion process. Myriad approaches for IM in social networks have also been worked upon with the utilization of centrality measures, like degree, betweenness, h-index, k-core, etc., that exploit the topological features of the network being widely popular. Centrality measures combined with community detection have also been explored to enhance seed node identification whilst considering both local as well as global characteristics.^31,32 HybridRank approach leverages utilises k-shell decomposition centrality to compute coreness of node and combines it with the eigenvector centrality of the node.³³ Liu & Zheng have proposed an approach that assesses nodes’ significance based on the concept of extended degree, that combines a node's degree with degree of its 1-order neighbors.³⁴ The local and global influence method determines the node's capacity for dissemination depending on its local location, computed by combining node's degree with its clustering coefficient, and global location computed using k -shell decomposition.³⁵ The Heuristic Independent Path Algorithm uses a vertex cover algorithm which helps minimize computational requirement by discarding irrelevant nodes from the selected domain.³⁶ The model proposed by Xu et al. first learns influence probabilities, and then transforms the problem to a weighted maximum cut problem which determines seed nodes by analysing the influence flow among nodes.³⁷ Group of Influential Nodes algorithm spots nodes with higher number of common neighbors and communication links by constructing different sub-graphs so as to expand the search area and reduce the amount of computations.³⁸ Ge et al. have proposed two models for diffusion that highlight the significance of multi-topic towards seed node selection.³⁹ Features like users’ spatiotemporal behaviour and community structure characteristics have also been explored for seed selection.^40,41 A prediction and replacement approach based on the anticipated outcome for mining seed nodes in a dynamic complex network first predicts the forthcoming network snapshot using information from previous network snapshots, and then uses a quick replacement approach to mine the seed nodes between snapshots.⁴² Aspects of a node's temporal behavior, like interaction frequency & self-similarity in interaction pattern, have also been exploited to study their role in quick and widespread spread of information across social networks.^43,44

Bellingeri et al. reviewed studies focussing on adopting a weighted network approach towards real-world social networks, emphasizing that edge weights capture interaction heterogeneity, making binary link representations insufficient.²² WVoteRank,⁴⁵ extended the concept of VoteRank⁴⁶ by considering edge weights, which are used to determine the seed nodes for unweighted network using voting scheme. WVoteRank computes each node's voting score whilst considering its 1-hop neighbors. Raamakirtinan and Livingston developed a weighted vote ranking approach with a dynamic weight parameter control that incorporates vote, weight, and coreness, to locate influential nodes.⁴⁷ Weighted Mixed Degree Decomposition technique for identifying influential nodes considers the underlying relationship strength, $k -$ shell value and exhausted degree.⁴⁸ Yang et al. have presented a remaining minimum degree decomposition method wherein weighted degree of nodes is first computed, and then minimum degree nodes are repetitively eliminated.⁴⁹ HookeRank approach, employs Hooke's law of elasticity to find influential spreaders in weighted social networks, by modelling edge weights as spring constants and then assessing the propagation path between the nodes.⁵⁰ The performance of four popular diffusion models has been explored and analysed with regards to IM in weighted social networks.¹⁹ The impact of trustworthiness, amid social network users, on the diffusion of news in the network has been investigated in ref.⁵¹ The authors have made use of a credibility network, which is an additional weighted directed layer with altering weights reliant on the spreading process. Furthermore, an information spreading model based on edge-weight-based compartmental theory has been developed for understanding the inherent mechanism of information spreading in weighted multiplex networks.⁵² Weighted Artificial Bee Colony algorithm is a bio-inspired approach that evaluates the fitness value of each node by resolving the reachability problem centred on the paths of maximum probability to spot seed nodes.⁵³ Weighted $k -$ shell degree neighborhood indexing approach combines $k -$ shell and degree through adjustable settings to select the most powerful spreaders.⁵⁴ Liu et al. suggested a ranking system based on network's assortativity value for locating key spreaders, wherein node's spreading ability is assessed using its degree and capacity to spread out.⁵⁵ Evidential $k -$ shell centrality based on prospective edge weight combines modified evidential centrality with consideration for the degree distribution & the layer of nodes situated in networks.⁵⁶ Potential edge weight based $k -$ shell degree neighborhood centrality assigns probable edge weights to the edges between nodes by using node degree, a derived network parameter, and k -shell index.⁵⁷ Table 1 presents a summary of the contribution of existing works done pertaining to weighted social networks.

Table 1.
Summary of literature review for weighted social networks.

Research Contribution

Connection Strength Existing Diffusion

Reference Seed Selection Algorithm Diffusion Model Considered Model Used

⁵ IC-P Greedy IC-P No IC, IC-W, TV

⁷ R-Greedy with Live-edge and Propagation-path LT-S No IC-W, TV

⁴⁴ Improved Weighted Vote Rank - Yes SIR

⁴⁶ Weighted Vote Ranking with Weighted Mixed Degree Decomposition - Yes SIR

⁴⁷ Extended Weighted Mixed Degree Decomposition - Yes SIR

⁴⁸ Extended Weighted Degree based on Remaining Minimum Degree Decomposition - Yes SIR

⁴⁹ HookeRank - Yes SIR

⁵⁰ - - No SIR, IC, IC-W, TV

⁵³ Weighted Artificial Bee Colony - Yes -

⁵⁴ Weighted $k -$ shell Degree Neighborhood Indexing - No SIR

⁵⁵ Ranking based on Tuning Weight Parameter - No SIR

⁵⁶ Evidential $k -$ shell Centrality based on Potential Edge Weight - Yes SI

⁵⁷ Potential Edge Weight based $k -$ shell Degree Neighborhood Centrality - Yes SIR

	Research Contribution
⁵	IC-P Greedy	IC-P	No	IC, IC-W, TV
⁷	R-Greedy with Live-edge and Propagation-path	LT-S	No	IC-W, TV
⁴⁴	Improved Weighted Vote Rank	-	Yes	SIR
⁴⁶	Weighted Vote Ranking with Weighted Mixed Degree Decomposition	-	Yes	SIR
⁴⁷	Extended Weighted Mixed Degree Decomposition	-	Yes	SIR
⁴⁸	Extended Weighted Degree based on Remaining Minimum Degree Decomposition	-	Yes	SIR
⁴⁹	HookeRank	-	Yes	SIR
⁵⁰	-	-	No	SIR, IC, IC-W, TV
⁵³	Weighted Artificial Bee Colony	-	Yes	-
⁵⁴	Weighted $k -$ shell Degree Neighborhood Indexing	-	No	SIR
⁵⁵	Ranking based on Tuning Weight Parameter	-	No	SIR
⁵⁶	Evidential $k -$ shell Centrality based on Potential Edge Weight	-	Yes	SI
⁵⁷	Potential Edge Weight based $k -$ shell Degree Neighborhood Centrality	-	Yes	SIR

Review of the existing literature highlights the fact that the studying of influence spread in weighted social networks has been addressed to a limited extent as compared to unweighted social networks. Moreover, most of the work focusses on developing algorithms for identifying better seed nodes. To a much lesser extent does the existing literature concentrate on the study of a diffusion approach that outlines the criteria for node activation. As stated above, the diffusion models widely employed for simulating diffusion in weighted social networks only consider the existence or non-existence of a link. Though it has been established that user interactions are more often than not influenced by the strength of relation shared by the actors, most studies have not taken this aspect into consideration. Motivated by these findings we propose the Link Strength Diffusion (LiSt-D) model for influence diffusion in (directed) weighted social networks, which takes into consideration the impact of the heterogeneity in relations that exists between two users.

3 Preliminaries

This section presents the working of the prevalent models used for studying diffusion in weighted social networks, namely Susceptible-Infected-Recovered (SIR) model,¹⁸ Independent Cascade (IC) model¹⁴ and IC-Weighted (IC-W) model.¹⁴

3.1 Network diffusion models

Study of existing literature (Table 1) pertaining to weighted social networks establishes SIR to be the most commonly and widely employed diffusion model. Further, IC and IC-W come a close second. This section presents the working of these three diffusion models.

3.1.1 Susceptible-infected-recovered model

In the SIR model, the nodes are classified into three groups: Susceptible (S), Infected (I), & Recovered (R). It is intended for susceptible nodes to get data from their neighbouring infected nodes. As per the SIR model, hosts become sick, hold onto the infection for a while, and then get recovered. After a host recovers, they no longer remain susceptible to catching the infection. To begin with, all nodes are regarded to be in a susceptible condition, with the exception of seed nodes. The susceptible neighbours of the infected nodes are affected with a likelihood of β after each advancement. Infected nodes move into the recovered stage with γ likelihood, and thereafter they are considered to be immune to further infection and are no longer susceptible.

3.1.2 Independent cascade & independent cascade-weighted models

In both IC and IC-W models, diffusion is contingent upon the likelihood of information propagating between the sender and the recipient. Following are the basic presumptions that guide how these model's function:

The underlying network is directed.

Nodes signify actors, whilst edges depict relationships between them.

Information can only be disseminated by a node to its outgoing neighbours.

State of a node can be either active (influenced), indicating adoption of diffused information, or inactive (uninfluenced).

Only an activated node can further activate its successors (aka outgoing neighbours).

Each activated node gets a single chance in which it can attempt to activate its inactive successors.

As the model is progressive, a node that has been activated cannot deactivate, and continues to maintain state until diffusion culminates.

Diffusion is carried out in discrete time steps.

The basis of IC and IC-W models is the belief that an uninfluenced node's chances of getting influenced improve as more of its inbound neighbours become active. Diffusion initiates at timestep t with an initial set of active nodes $(S)$ . Every active node u makes a probability-based effort (say, $p_{u, v}$ ) to activate its dormant successor v at each timestep $t + 1$ . The propagation probability, $p_{u, v}$ , must be calculated or known before the diffusion process begins. $p_{u, v}$ serves as the node activation criterion and illustrates the likelihood of information spreading from node $u$ to v. A random number ( $r$ ) is produced at each timestep and is compared to $p_{u, v}$ for all of the activations from the preceding timestep. If $r \leq p_{u, v}$ , the node v becomes active and information spreads across edge $(u, v)$ . If $r > p_{u, v}$ , then there is no activation. If the activation attempt is successful, the node v moves into the active state; if not, it stays in the inactive state until another attempt at activation by an active predecessor is successful. Diffusion continues until there are no more activations feasible. Albeit the fundamental operating principle of both IC and IC-W is same, $p_{u, v}$ is computed differently.

IC model - The research community generally assigns a uniform propagation probability to all edges, since there is no widely accepted standard method for determining the value of $p_{u, v}$ .

IC-W model: The value of $p_{u, v}$ here is equal to the reciprocal of the node's incoming degree v (Eq. 6). As a result, the value of $p_{u, v}$ varies throughout the network.

\begin{aligned} p_{u v} = \frac{1}{i n_d e g r e e (v)} \end{aligned}

(6)

4 Proposed link strength diffusion model

Real-world social networks are often weighted, wherein a weight is associated with the connecting links such that the weight represents the heterogeneity in the strength of the relation (link) between the connected users.²⁰ Further, the manner in which information (influence) diffuses from a node to its connections, hugely impacts the spread attained.^2,18 Drawing motivation from this notion, our work focuses on developing a diffusion model called Link Strength Diffusion (LiSt-D), for (directed) weighted social networks that incorporates the heterogeneity in the strength of link between social actors when computing the influence diffusion probability. In actuality, link strength generally varies between each pair of social actors, with some being strong, and some weak. Since a higher edge weight signifies more link-strength, the chances of communication and subsequent sharing of information over such an edge is higher, as compared to an edge with lower edge weight. In the proposed LiSt-D model, link strength is read in the direction from the sender to the receiver, i.e., relationship strength is assessed from the sender's perspective. The LiSt-D model is probabilistic, progressive and assumes the social network to be directed.

Algorithm 1 outlines the approach of the proposed LiSt-D model. The model takes a weighted social network $(P)$ as input along with a list of seed nodes $(Δ)$ that serve as initiators of the diffusion process. A weighted social network can be conceptualized as a graph $P = (N, L, w)$ , where N denotes the node set (users), L denotes the edge set, and w signifies a weight function that maps L to the set of real numbers. Output is the set of nodes that get influenced (activated) at the culmination of the diffusion process. The Propagation Probability (PP) for an edge connecting node $n_{i}$ and node $n_{j}$ is computed as per equation (10),

\begin{aligned} P P_{n_{i}, n_{j}} = \frac{w_{n_{i}, n_{j}}}{\sum_{n_{s} \in O u t_{n_{i}}} w_{n_{i}, n_{s}}} \end{aligned}

(10)

where,

w_{n_{i}, n_{j}}

is the weight on the edge from node

n_{i}

to node

n_{j}

, and

O u t_{n_{i}}

represents the set of successors (outgoing neighbors) of

n_{i}

. Post the computation of edge propagation probabilities, the diffusion process initiates with an initial set of seed nodes

(Δ)

.The diffusion process unfolds in discrete time steps, often referred to as cascades. The seed nodes are considered to be active at initial time step, hence

I_{G}

is initialized with

Δ

t = 0

. During each subsequent cascade, a newly influenced node

δ_{k} \in I_{G}

attempts activation of its inactive successor

s_{m}

(such that

s_{m} \notin I_{G}

), with the probability

P P_{δ_{k}, s_{m}}

. Every newly influenced node gets only one chance of influencing (activating) its yet uninfluenced successors. If an influenced node

(δ_{k})

succeeds in its effort, its uninfluenced successor

(s_{m})

changes state and becomes influenced, and gets included in the set

I_{G}

, thereby becoming eligible to further influence its uninfluenced successors. For activation, a random number

(r)

between

[0, 1]

gets computed for every combination of

(δ_{k}, s_{m})

. If

r \leq P P_{δ_{k}, s_{m}}

, then node

s_{m}

transitions state from uninfluenced to influenced and gets included in the set

I_{G}

, else

s_{m}

continues to remain uninfluenced. This process continues iteratively, with cascades progressing until no new activations occur in a given step. The diffusion terminates when a cascade results in no additional nodes being activated, indicating that the influence can no longer spread further through the network.

Table 2 lists the symbols and their meaning as used in Algorithm 1.

Table 2.

List of symbols used in Algorithm 1.

Symbol	Description
$P$	Weighted social network
$N$	Set of nodes in $P$
$L$	Set of edges $P$
$w$	Set of edge-weights (representing link-strength)
$Δ$	Set of seed nodes (initiators)
$O u t_{n_{i}}$	Set of successors of node $n_{i}$
$W_{n_{i}}$	Sum of weights assigned to all edges originating from node $n_{i}$
$w_{n_{i}, n_{j}}$	Weight associated with edge connecting node $n_{i}$ and node $n_{j}$
$E P P_{n_{i}, n_{j}}$	Edge propagation probability for edge ( $n_{i}, n_{j})$
$I_{G}$	Set of influenced nodes
$L_{I_{G}}$	Number of influenced nodes
$r$	Random number between 0 and 1
$\cup$	Set union operator

4.1 Time complexity analysis

The time complexity of the proposed LiSt-D model comprises two steps: (i) PP assignment for every edge, and (ii) influence spread of each influenced (active) node. In step (i), for each node, weights assigned to the edges between a node and its successor(s) are considered. Assuming a network to be having a total of n nodes and that each node can have up to $(n - 1)$ successors, to initialize the PP for all edges will take $[N$ nodes $* (N - 1)$ successors $]$ , i.e., $O (N \cdot (N - 1)) = O (N^{2})$ time. Further in step (ii), at each time step, every active node attempts to influence its uninfluenced successors. Suppose the process takes T time steps until termination. At any step, if I nodes are in the active state, then at most (N−I) successors remain uninfluenced, leading to up to $O (I \cdot (N - I))$ attempts in that step. Summing across all T steps bounds the overall cost of this phase. Since across the entire diffusion each of the N nodes can attempt activation of at most all other $(N - 1)$ nodes, the total number of influence-attempt operations over all steps is bounded by $O (N \cdot (N - 1)) = O (N^{2})$ . To abridge, the overall computational time complexity for LiSt-D model is the summation of the time required for step (i) and (ii), i.e., $O (N^{2} + N^{2}) = O (N^{2})$ .

5 Experiments and discussion

Testing of the proposed LiSt-D model was done by employing it to simulate influence diffusion in three real-world weighted social networks, and comparing its performance with other prevalent models. Google Colaboratory (aka Colab) has been used for implementing the experiments using Python programming language.

5.1 Dataset description

Three real-world weighted social networks of varying sizes and topological features were used to simulate the diffusion process. All selected networks were directed and follow power-law degree distributions. These datasets were downloaded from http://konect.cc/. Table 3 lists the statistics of the selected networks. In Bitcoin Alpha and Bitcoin OTC networks, edge weights lie in the range [−10, 10], whereas in case of Advogato network all edge weights are positive. As the current study focuses on diffusion in weighted social network, only the magnitude of the edge weight was taken into consideration and not the sign associated with it. To address the concerns associated with negative values, all edge weights were offset by adding a constant value of 10. This transformation preserves the relative differences between data points while ensuring all values are positive.

Bitcoin Alpha:^58,59 A social network of people who trade Bitcoins on the Bitcoin Alpha platform. Nodes represent members of Bitcoin Alpha platform and an edge between two members denotes that the left member gave a trust rating to the right member. Edge weights represent the ratings given by the member on the left.

Bitcoin OTC:^58,59 A social network of people who trade Bitcoins on the Bitcoin OTC platform. Nodes represent members of Bitcoin OTC platform and an edge between two members denotes that the left member gave a trust rating to the right member. Edge weights represent the ratings given by the member on the left.

Advogato:^59,60 A social network representing an online community for developers. Each node denotes a developer and the directed edges represent trust relationships. Edge weights represent the ratings given by the left developer to the right developer.

Table 3.
Statistics of the three real-world social networks.*

Dataset $| N |$ $| E |$ $k_{m a x}$ $⟨ k ⟩$ $δ$ $⟨ C C ⟩$

Bitcoin Alpha 3783 24,186 888 12.786 10 0.078

Bitcoin OTC 5881 35,592 1298 12.104 9 0.059

Advogato 6541 51,127 943 15.632 9 0.092

Dataset	$\| N \|$	$\| E \|$	$k_{m a x}$	$⟨ k ⟩$	$δ$	$⟨ C C ⟩$
Bitcoin Alpha	3783	24,186	888	12.786	10	0.078
Bitcoin OTC	5881	35,592	1298	12.104	9	0.059
Advogato	6541	51,127	943	15.632	9	0.092

* $| N |$ = number of nodes, $| E |$ = number of edges, $⟨ k ⟩$ = average degree,

$k_{m a x}$ =maximum degree, $δ$ =network diameter, $⟨ C C ⟩$ =average clustering coefficient

Table 4.

Influence spread attained by seed nodes in Bitcoin Alpha Network ( $z = 50$ ).*

Cascade	Model	DC	BC	CC	EC	PR	HC	$k$ C
1	SIR	366.50 ± 22.13	362.95 ± 29.90	316.12 ± 26.95	327.88 ± 26.21	376.66 ± 27.47	286.90 ± 23.49	339.36 ± 26.17
	IC-W	1268.69 ± 18.71	1325.51 ± 18.56	1114.49 ± 18.60	1127.51 ± 15.49	1298.87 ± 18.48	996.14 ± 17.19	1170.02 ± 19.68
	LiSt-D	1747.08 ± 13.30	1809.73 ± 14.49	1515.78 ± 16.40	1512.77 ± 16.39	1789.82 ± 16.55	1409.34 ± 15.10	1553.30 ± 16.01
2	SIR	625.96 ± 43.50	614.46 ± 48.79	536.15 ± 44.18	561.33 ± 46.21	638.85 ± 42.36	487.34 ± 43.07	579.25 ± 43.22
	IC-W	1628.62 ± 42.82	1680.92 ± 41.21	1440.39 ± 38.77	1462.77 ± 43.78	1654.34 ± 39.30	1319.48 ± 42.82	1512.50 ± 42.33
	LiSt-D	3140.60 ± 14.39	3181.97 ± 15.18	3112.58 ± 16.39	3080.03 ± 14.76	3160.22 ± 14.80	3087.15 ± 15.19	3063.56 ± 18.31
3	SIR	852.47 ± 58.61	837.53 ± 63.42	737.71 ± 61.91	771.12 ± 65.63	868.62 ± 54.97	670.01 ± 62.35	787.69 ± 61.79
	IC-W	1819.83 ± 54.37	1869.78 ± 58.11	1642.88 ± 54.98	1660.96 ± 58.11	1843.58 ± 63.21	1515.59 ± 65.87	1695.69 ± 58.63
	LiSt-D	3383.25 ± 12.39	3401.92 ± 11.93	3392.60 ± 11.10	3387.33 ± 10.16	3388.81 ± 11.25	3385.24 ± 12.88	3374.28 ± 12.64
4	SIR	1059.16 ± 70.75	1047.46 ± 80.02	930.56 ± 76.89	966.50 ± 78.25	1082.81 ± 65.78	842.22 ± 80.60	985.13 ± 78.12
	IC-W	1902.79 ± 71.05	1933.26 ± 59.45	1747.16 ± 70.10	1750.07 ± 71.85	1917.37 ± 60.71	1618.24 ± 90.87	1783.79 ± 72.14
	LiSt-D	3417.40 ± 11.76	3417.87 ± 10.02	3416.89 ± 11.41	3415.53 ± 10.60	3416.24 ± 11.46	3417.08 ± 10.97	3414.44 ± 10.73
5	SIR	1258.33 ± 80.43	1251.73 ± 93.29	1121.07 ± 92.91	1158.95 ± 90.07	1290.17 ± 79.60	1014.05 ± 99.66	1172.98 ± 90.25
	IC-W	1927.86 ± 65.50	2002.68 ± 67.07	1766.95 ± 84.21	1770.81 ± 75.70	1958.51 ± 63.69	1671.43 ± 91.33	1831.89 ± 73.27
	LiSt-D	3416.47 ± 11.21	3416.06 ± 11.72	3418.83 ± 10.82	3418.03 ± 10.73	3418.65 ± 9.14	3420.43 ± 10.11	3416.56 ± 9.70

* DC: Degree Centrality, BC: Betweenness Centrality; CC: Closeness Centrality; EC: Eigenvector Centrality; PR: PageRank Centrality; HC: H-index Centrality; k C: $k -$ core Centrality.

* Values averaged over 100 trials.

Table 5.

Influence spread attained by seed nodes in Bitcoin OTC Network ( $k = 50$ ).*

Cascade	Model	DC	BC	CC	EC	PR	HC	$k$ C
1	SIR	478.22 ± 35.07	493.22 ± 35.07	434.44 ± 33.03	451.83 ± 32.83	496.20 ± 35.14	315.22 ± 26.973	469.65 ± 37.77
	IC-W	2037.34 ± 21.68	2110.01 ± 24.71	1746.89 ± 20.88	1799.63 ± 21.48	1964.42 ± 20.05	1964.42 ± 20.05	1946.83 ± 22.57
	LiSt-D	2149.99 ± 21.15	2235.31 ± 20.42	1828.42 ± 21.19	1867.31 ± 20.34	2145.26 ± 19.59	1592.40 ± 16.57	2048.51 ± 16.11
2	SIR	822.89 ± 58.83	849.41 ± 54.71	752.42 ± 58.67	785.34 ± 56.73	861.69 ± 61.53	548.24 ± 46.97	812.76 ± 67.34
	IC-W	2577.64 ± 47.16	2663.66 ± 58.19	2269.29 ± 59.20	2322.15 ± 64.00	2488.02 ± 55.16	2488.02 ± 55.16	2473.61 ± 64.71
	LiSt-D	4401.54 ± 24.93	4440.84 ± 20.04	4288.06 ± 33.23	4254.63 ± 35.23	4364.03 ± 29.51	4213.28 ± 31.45	4298.77 ± 37.20
3	SIR	1132.98 ± 80.09	1160.91 ± 76.18	1035.05 ± 81.36	1083.28 ± 83.53	1177.08 ± 89.12	764.58 ± 71.49	1116.31 ± 91.44
	IC-W	2877.07 ± 76.36	2947.48 ± 71.43	2574.91 ± 94.85	2617.90 ± 91.20	2816.39 ± 87.19	2816.39 ± 87.19	2784.52 ± 81.54
	LiSt-D	4840.36 ± 18.17	4844.84 ± 15.04	4836.74 ± 16.87	4823.74 ± 16.88	4843.05 ± 15.45	4842.72 ± 17.77	4829.97 ± 15.71
4	SIR	1418.82 ± 100.90	1445.24 ± 100.66	1299.43 ± 108.21	1360.82 ± 105.92	1475.06 ± 116.23	978.64 ± 89.04	1398.41 ± 112.13
	IC-W	2999.24 ± 97.77	3069.72 ± 90.09	2710.09 ± 114.39	2760.74 ± 117.47	2922.11 ± 99.66	2922.11 ± 99.66	2910.77 ± 94.09
	LiSt-D	4911.13 ± 15.63	4908.21 ± 14.78	4909.91 ± 16.34	4910.33 ± 14.74	4912.35 ± 15.99	4921.52 ± 13.42	4907.36 ± 15.60
5	SIR	1697.22 ± 120.14	1720.78 ± 123.90	1564.50 ± 128.73	1624.85 ± 122.75	1755.41 ± 138.02	1194.81 ± 114.59	1670.08 ± 132.97
	IC-W	3059.69 ± 86.83	3135.63 ± 105.07	2791.49 ± 123.64	2827.55 ± 122.68	3000.98 ± 113.32	3000.98 ± 113.32	2958.84 ± 108.51
	LiSt-D	4917.73 ± 16.71	4919.50 ± 14.59	4919.37 ± 14.31	4919.93 ± 16.00	4920.09 ± 13.31	4936.66 ± 14.13	4916.57 ± 17.45

* DC: Degree Centrality, BC: Betweenness Centrality; CC: Closeness Centrality; EC: Eigenvector Centrality; PR: PageRank Centrality; HC: H-index Centrality; k C: $k -$ core Centrality.

* Values averaged over 100 trials.

Table 6.

Influence spread attained by seed nodes in Advogato Network ( $k = 50$ ).*

Cascade	Model	DC	BC	CC	EC	PR	HC	$k$ C
1	SIR	234.08 ± 20.60	189.74 ± 16.62	238.67 ± 19.77	230.67 ± 16.66	234.24 ± 21.87	118.36 ± 14.97	232.58 ± 20.89
	IC-W	710.45 ± 19.14	754.88 ± 18.03	311.41 ± 11.76	221.37 ± 11.39	193.16 ± 8.66	153.21 ± 8.42	571.91 ± 17.03
	LiSt-D	1786.13 ± 13.86	2007.77 ± 15.85	1030.42 ± 9.36	968.10 ± 6.93	905.90 ± 5.36	766.61 ± 6.07	1621.25 ± 13.16
2	SIR	413.15 ± 33.92	328.36 ± 29.83	420.73 ± 33.60	406.33 ± 27.82	413.45 ± 36.37	186.30 ± 28.52	407.93 ± 34.96
	IC-W	1007.83 ± 41.10	1080.72 ± 39.41	452.17 ± 32.56	307.89 ± 26.43	263.08 ± 25.37	225.68 ± 33.77	862.53 ± 45.85
	LiSt-D	3721.63 ± 18.50	3809.12 ± 15.93	3000.45 ± 35.96	2984.48 ± 26.81	2778.18 ± 16.22	3084.55 ± 18.73	3648.91 ± 22.98
3	SIR	592.97 ± 53.55	463.15 ± 44.14	604.72 ± 44.16	579.74 ± 41.10	590.90 ± 52.23	258.50 ± 44.57	585.79 ± 51.56
	IC-W	1190.25 ± 59.99	1268.12 ± 60.98	537.85 ± 55.60	378.89 ± 57.62	309.25 ± 51.82	292.26 ± 60.14	1041.73 ± 63.58
	LiSt-D	4124.59 ± 8.60	4127.61 ± 7.90	4052.76 ± 11.54	4044.05 ± 11.75	4004.99 ± 13.47	4057.21 ± 9.70	4121.64 ± 8.21
4	SIR	772.54 ± 69.09	603.48 ± 59.71	788.57 ± 59.49	755.68 ± 55.76	772.03 ± 68.65	335.14 ± 64.31	764.02 ± 65.39
	IC-W	1283.44 ± 66.39	1350.56 ± 69.95	613.49 ± 81.35	422.96 ± 68.22	335.57 ± 54.50	335.76 ± 72.48	1133.82 ± 69.25
	LiSt-D	4145.42 ± 8.31	4147.10 ± 8.23	4139.63 ± 8.36	4141.57 ± 7.62	4139.13 ± 7.44	4142.92 ± 7.69	4146.27 ± 7.70
5	SIR	952.51 ± 85.76	748.48 ± 73.81	976.01 ± 72.77	934.81 ± 71.55	957.35 ± 83.27	414.78 ± 84.07	944.79 ± 79.54
	IC-W	1329.05 ± 81.43	1402.20 ± 78.23	658.02 ± 100.40	453.15 ± 87.62	379.44 ± 77.42	381.67 ± 104.89	1170.13 ± 86.68
	LiSt-D	4146.82 ± 7.97	4146.23 ± 7.34	4145.38 ± 8.24	4147.04 ± 7.26	4145.77 ± 8.00	4148.33 ± 7.48	4146.01 ± 9.01

* DC: Degree Centrality, BC: Betweenness Centrality; CC: Closeness Centrality; EC: Eigenvector Centrality; PR: PageRank Centrality; HC: H-index Centrality; k C: $k -$ core Centrality.

* Values averaged over 100 trials.

Figure 1.

Average influence spread achieved for Bitcoin Alpha network.

Figure 2.

Average influence spread achieved for Bitcoin OTC network.

Figure 3.

Average influence spread achieved for Advogato network.

5.2 Findings and discussion

Study of existing literature pertaining to weighted social networks also brought forth the fact that most research works have focussed on devising mechanisms for seed node identification, and not on developing diffusion models that accommodate edge weights, resulting in a lack of diffusion models specific for weighted social networks. Hence, in our work, we carry out diffusion in weighted social networks by employing the proposed LiST-D model along with two popular diffusion models, namely SIR and IC-W, and study the findings. We chose SIR as most studies on IM in weighted social networks have been carried out using this model (refer Table 1). IC-W was chosen as its working can be said to include the notion of weighted propagation probability, to some extent. It may be noted that for each model, the parameter settings have been done as per the common practice followed by the research fraternity of the addressed subject area. In SIR model, infection rate (β) is kept as 0.3, and recovery rate (γ) is kept as 0.1. In IC-W model, the value of $p_{u, v}$ equals the reciprocal of the incoming degree of node v. To simulate the dispersion process in the three models of diffusion considered in this work, we used seven centrality measures, namely Degree Centrality (DC),⁶¹ Betweenness Centrality (BC),^61,62 Closeness Centrality (CC),⁶¹ Eigenvector Centrality (EC),^63,64 PageRank Centrality (PR),⁶⁵ H-index Centrality (HC),⁶⁶ and k -core Centrality ( $k$ C)³² to generate initial seed sets of size $z = 50$ ( $z$ denotes the count of seed nodes). The generated seed sets were provided as input to all three models. In all experiments, the initial seed set size was fixed at 50 nodes. This choice follows established practice in influence diffusion studies, where a seed set of size 50 is widely used as a benchmark. Using a fixed seed size across datasets ensures methodological consistency and enables fair comparison of diffusion outcomes across networks with differing sizes and topological characteristics.

In this study, we simulate diffusion under the three models considered, proposed LiSt-D, SIR, and IC-W, and record the experimental findings in terms of a common outcome measure, namely the influence spread achieved by a given seed set. Specifically, if diffusion is initiated with a seed set, the spread is computed as the total number of nodes that become influenced by the end of the process. This provides a consistent basis for comparison across models, even though the underlying mechanisms differ.

It is important to emphasize that the methodological assumptions of the three models define distinct diffusion environments. The LiSt-D and IC-W models both incorporate heterogeneity in infection rates: in IC-W, propagation probability depends on the in-degree of the target node, whereas in LiSt-D, it varies with the link strength between connected nodes. In contrast, the SIR model employs a fixed infection rate, which can slow down diffusion relative to the other two. Further, while both LiSt-D and IC-W treat each node as either uninfluenced or influenced (with the set of influenced nodes monotonically increasing over time), the SIR model allows nodes to transition to a recovered state, thereby stopping their contribution to further spread. This difference naturally results in different growth trajectories of influenced nodes across the three models.

Nevertheless, by consistently measuring the final influence spread achieved for seed sets of equal size and chosen using same centrality measure, we establish a fair basis for reporting results and drawing relative insights. Tables 4 –6 present the influence spread attained by the seven seed sets (50 seeds each) across the three real-world social networks studied, with the process simulated for five cascades under each diffusion model. A cascade refers to a discrete time step in the diffusion cycle. Each cascade was executed 100 times, with the intent to alleviate the impact of variation due to randomness, and the average value is reported in the tables. The reported values are in the format of (mean ± standard deviation).

SIR model demonstrates gradual growth and progressive improvement. It shows a steady and consistent increase in spread values across all centrality measures from the subsequent cascades. In cascade1, the model starts with relatively lower spread value, indicating a modest baseline. However, over time, each cascade reveals a marked improvement in performance. By cascade5, the values across all centrality measures have increased significantly compared to the first cascade, indicating that the SIR model becomes more robust or effective over time. IC-W model demonstrates moderate initial performance with consistent gains. It displays a positive growth pattern, though it starts at a higher initial baseline than SIR model. Each successive cascade results in an increase across all centrality measures, although the gains per cascade are generally smaller or more controlled than SIR's jumps.

LiSt-D model demonstrates high initial performance with early plateau. It starts off with substantially higher values across all centrality measures, and these values continue to increase slightly until the third cascade. Thereafter, LiSt-D appears to reach a plateau. The values in cascades 4 and 5 are almost static and near-maximum. This suggests that LiSt-D reaches its optimal performance relatively early, thereby stabilizing its performance. In general, all three models show positive trends over time, indicating that the diffusion process under each model supports iterative improvement, though the magnitude and timing of improvements differ. SIR model is characterized by strong upward growth, IC-W by stable and steady progression, and LiSt-D by early peak performance with high stability. The average influence spread attained by the selected seed sets is presented in Figures 1, 2 and 3 for networks Bitcoin Alpha, Bitcoin OTC and Advogato, respectively. The vertical axis of the figures signifies the influence spread, whilst the horizontal axis signifies the seed selection measures.

Figures 4, 5, and 6 show the diffusion progression trend with respect to the percentage of network influenced, for the three diffusion models, when the diffusion process is initiated using the seed nodes identified through DC. It can be observed from Figures 4, 5 and 6 that cascade 3 onwards, the percentage of network that gets influenced is almost stabilized under the LiSt-D model in all three networks under consideration. For IC-W model stability is visible approximately from cascade 4 onwards. Under the proposed LiSt-D model, 90% of Bitcoin Alpha, 84% of Bitcoin OTC, and 63% Advogato network is getting influenced by the selected seed nodes in five cascades, however under IC-W model, the same seed nodes cover only 51%, 52%, and 20% of Bitcoin Alpha, Bitcoin OTC, and Advogato network, respectively. Similarly, for Bitcoin OTC social network, LiSt-D model is able to cover around 83% of the network, while IC-W and SIR models have covered around 50%, 21%, and 15% of the network, respectively.

Figure 4.

Diffusion trend for Bitcoin Alpha network using DC seed nodes.

Figure 5.

Diffusion trend for Bitcoin OTC network using DC seed nodes.

Figure 6.

Diffusion trend for Advogato network using DC seed nodes.

6 Conclusion

Diffusion models play a crucial role in understanding how influence spreads across a social network by defining the conditions under which influence is transferred from one node to another. Many existing studies tend to focus on a simplified binary view of relationships, i.e., the presence and absence of edges between two actors. Nevertheless, most real-world social networks are weighted, with link strength showing how much one person can influence another. This variation in connection strength is key to understanding how influence spreads and should be considered in diffusion studies Therefore, diffusion models that account for link strength (depicted by edge weights) are necessary to capture the nuances of real-world social influence and offer more realistic predictions of influence dynamics.

In our research, we have proposed the Link Strength Diffusion (LiSt-D) model for influence diffusion in (directed) weighted social networks. The speed of information or influence spread in a network is heavily affected by relationship strength. The proposed model accounts for these variations when calculating diffusion probability between connected users. In this study, simulation of the diffusion process under proposed LiSt-D model is done using three real-world weighted social networks, namely Bitcoin Alpha, Bitcoin OTC and Advogato, wherein edge weights represent the ratings given by the source node to the target node. Seed node selection for initiating the diffusion process is done using seven widespread seed selection algorithms. We conducted experiments to study the diffusion process under three diffusion models, the proposed LiSt-D, SIR, and IC-W. Experimental findings were recorded in terms of the influence spread achieved by the seed nodes. In general, all three models show positive trends over time, indicating that the diffusion process under each model supports iterative improvement, though the magnitude and timing of improvements differ. The proposed LiST-D model is characterized by early peak performance with high stability, SIR model demonstrates strong upward growth, while IC-W shows stable and steady progression. In LiSt-D model, the propagation probability varies with the variation in the link-strength between connecting pair of nodes. There is a heterogeneity in the infection rate for each link, which might result in some propagation probability values being higher, and so spreading of influence might happen at a faster pace, relative to when the propagation probability is constant over all links. The findings showed that diffusion spread under LiSt-D covered 90% of the Bitcoin Alpha network, 84% of Bitcoin OTC, and 63% of the Advogato network.

One of the key limitations of the presented work is that the real-world data being used relies on historical user data (e.g., past interactions) and does not account for the dynamicity in user behaviour. User behaviour can change over time and so the patterns of information spread at one time may not hold in the future. Also, like most work being done in the addressed subject area, the presented work also assumes a fixed and simple network structure, though real-world networks are dynamic and complex, with varying connectivity and changing relationships between individuals over time. Furthermore, social media platforms, which are often used for data collection, may not provide a representative sample of the population, leading to the network data often being incomplete, biased, or difficult to obtain. In addition, privacy concerns also restrict the amount of data that can be gathered. These limitations can be addressed in future work thereby enabling the proposed model to better capture real-world complexities. Future works can further augment the proposed work by incorporating edge weight contribution from the receiving node's perspective. Additionally, multiple objectives such as signed behavior of users, interaction frequency and other activity-based parameters can also be taken into consideration.

Footnotes

ORCID iDs

Megh Singhal

Bhawna Saxena

Author contributions

Megh Singhal – Methodology, experiment conduction, formal analysis, initial draft writing, review and editing

Bhawna Saxena - Conceptualization, methodology, review and editing, supervision.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

Declaration of conflicting interest

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

Han

. Influence analysis: a survey of the state-of-the-art. Math Found Comput 2018; 1: 201–253.

Zhang

Huang

. Social influence analysis: models, methods, and evaluation. Engineering 2018; 4: 40–46.

Peng

Wang

Xie

. Social influence analysis in social networking big data: opportunities and challenges. IEEE Netw 2016; 31: 11–17.

Peng

Zhou

Cao

, et al. Influence analysis in social networks: a survey. J Netw Comput Appl 2018; 106: 17–32.

Chakraborty

, et al. Polarity related influence maximization in signed social networks. PloS one 2014; 9: e102199.

Shen

Nishide

Piumarta

, et al. Influence maximization in signed social networks. In: Web Information Systems Engineering–WISE 2015: 16th International Conference, Part I 16, 2015, pp.399–414. 10.1007/978-3-319-26190-4_27

Liang

Shen

, et al. Influence maximization in signed social networks with opinion formation. IEEE Access 2019; 7: 68837–68852.

Zareie

Sakellariou

. Influence maximization in social networks: a survey of behaviour-aware methods. Soc Netw Anal Min 2023; 13: 78.

Brown

Reingen

. Social ties and word-of-mouth referral behavior. J Consum Res 1987; 14: 350–362.

10.

Goldenberg

Libai

Muller

. Talk of the network: a complex systems look at the underlying process of word-of-mouth. Mark Lett 2001; 12: 211–223.

11.

Domingos

Richardson

. Mining the network value of customers. In: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, 2001, pp.57–66. 10.1145/502512.502525

12.

Richardson

Domingos

. Mining knowledge-sharing sites for viral marketing. In: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, 2002, pp.61–70. 10.1145/775047.775057

13.

Kempe

Kleinberg

Tardos

. Maximizing the spread of influence through a social network. In: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, 2003, pp.137–146. 10.1145/956750.956769

14.

Hosseini-Pozveh

Zamanifar

Naghsh-Nilchi

. Assessing information diffusion models for influence maximization in signed social networks. Expert Syst Appl 2019; 119: 476–490.

15.

Wang

Gao

, et al. A survey on information diffusion in online social networks: models and methods. Information 2017; 8: 18.

16.

Fan

Wang

, et al. Influence maximization on social graphs: a survey. IEEE Trans Knowl Data Eng 2018; 30: 1852–1872.

17.

Saxena

Anand

, et al. A Hurst-based diffusion model using time series characteristics for influence maximization in social networks. Expert Syst 2023; 40: e13375.

18.

Newman

. Spread of epidemic disease on networks. Phys Rev E 2002; 66: 016128.

19.

Singhal

Saxena

. Exploring the performance of diffusion models in weighted social networks. In: Proceedings of 2024 2nd International Conference on Disruptive Technologies (ICDT), 2024, pp.920–923. 10.1109/ICDT61202.2024.10489068

20.

Newman

MEJ

. Analysis of weighted networks. Phys Rev E - Stat Nonlinear, Soft Matter Phys 2004; 70: 056131–9.

21.

Newman

. Networks. New York: Oxford University Press, 2018.

22.

Bellingeri

Bevacqua

Sartori

, et al. Considering weights in real social networks: a review. Front Phys 2023; 11: 1152243.

23.

Granovetter

. The strength of weak ties. AJS 1973; 78: 1360–1380.

24.

Sun

Tang

. A survey of models and algorithms for social influence analysis. In: Aggarwal

(eds) Social network data analytics. Boston, MA: Springer, 2011, pp.177–214. 10.1007/978-1-4419-8462-3_7

25.

Ren

Meng

, et al. A large-scale group decision making model based on trust relationship and social network updating. CMES-Comput Model Eng Sci 2024; 138: 429–458.

26.

Kridera

Kanavos

. Exploring trust dynamics in online social networks: a social network analysis perspective. Math Comput Appl 2024; 29: 37.

27.

Metcalf

Casey

. Chapter 5 - graph theory. In: Metcalf

Casey

(eds) Cybersecurity and applied mathematics. Cambridge, MA: Syngress, 2016, pp.67–94.

28.

Jain

Katarya

Sachdeva

. Opinion leaders for information diffusion using graph neural network in online social networks. ACM Trans Web 2023; 17: 1–37.

29.

Mehta

Mishra

. Trust exploitation in graph based social recommender systems: a survey. In: 2024 Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE), 2024, pp.1–9: IEEE.

30.

Lezhnina

Kalinina

Karpenko

, et al. Strategy for maximizing information influence in social networks. In: AIP Conference Proceedings (Vol. 3094, No. 1), AIP Publishing, 2024.

31.

Hajarathaiah

Enduri

Anamalamudi

, et al. Algorithms for finding influential people with mixed centrality in social networks. Arab J Sci Eng 2023; 48: 10417–10428.

32.

Pattanayak

Saxena

Sinha

. Influence maximization in social networks using community-diversified seed selection. J Complex Netw 2024; 12: cnae008.

33.

Ahajjam

Badir

. Identification of influential spreaders in complex networks using HybridRank algorithm. Sci Rep 2018; 8: 11932.

34.

Liu

Zheng

. Identifying important nodes in complex networks based on extended degree and E-shell hierarchy decomposition. Sci Rep 2023; 13: 3197.

35.

Qiu

Zhang

Tian

. Ranking influential nodes in complex networks based on local and global structures. Applied Intelligence 2021; 51: 4394–4407.

36.

Kianian

Rostamnia

. An efficient path-based approach for influence maximization in social networks. Expert Syst Appl 2021; 167: 114168.

37.

, et al. A novel approach to online social influence maximization. Soc Netw Anal Min 2014; 4: 53.

38.

Aghaee

Kianian

. Influence maximization algorithm based on reducing search space in the social networks. SN Appl Sci 2020; 2: 2067.

39.

Shi

, et al. Human-driven dynamic community influence maximization in social media data streams. IEEE Access 2020; 8: 162238–162251.

40.

Chen

Zhao

Liu

, et al. Efficient similarity-aware influence maximization in geo-social network. IEEE Trans Knowl Data Eng 2020; 34: 4767–4780.

41.

Qin

Zhong

Yang

. An influence maximization algorithm based on community-topic features for dynamic social networks. IEEE Trans Netw Sci Eng 2022; 9: 608–621.

42.

Zhang

Kan

. Influence maximization based on snapshot prediction in dynamic online social networks. Mathematics 2022; 10: 1341.

43.

Saxena

Kumar

. A node activity and connectivity-based model for influence maximization in social networks. Soc Netw Anal Min 2019; 9: 40.

44.

Saxena

. Towards establishing the effect of self-similarity on influence maximization in online social networks. Soc Netw Anal Min 2020; 10: 35.

45.

Kumar

Panda

. Identifying influential nodes in weighted complex networks using an improved WVoteRank approach. Appl Intell 2022; 52: 1838–1852.

46.

Zhang

Chen

Dong

, et al. Identifying a set of influential spreaders in complex networks. Sci Rep 2016; 6: 27823.

47.

Raamakirtinan

Livingston

LMJ

. Identifying influential spreaders in Complex networks by weighted vote ranking and hybrid methods. J Theor Appl Inform Technol 2021; 99: 1642–1661.

48.

Raamakirtinan

Livingston

LMJ

. Identifying influential spreaders in complex networks based on weighted mixed degree decomposition method. Wirel Pers Commun 2022; 127: 2103–2119.

49.

Yang

, et al. Ranking the spreading influence of nodes in complex networks: an extended weighted degree centrality based on a remaining minimum degree decomposition. Phys Lett A 2018; 382: 2361–2371.

50.

Kumar

Aggarwal

Panda

. Identifying influential spreaders on a weighted network using HookeRank method. In: Krzhizhanovskaya

, et al. (ed.) Computational Science – ICCS 2020. ICCS 2020. Lecture Notes in Computer Science, vol 12137. Cham: Springer, 2020, pp.609–622.

10.1007/978-3-030-50371-0_45

51.

Carchiolo

Longheu

Malgeri

, et al. Mutual influence of users credibility and news spreading in online social networks. Future Internet 2021; 13: 107.

52.

Zhu

, et al. Information spreading on weighted multiplex social network. Complexity 2019; 2019: 5920187.

53.

Cantini

Marozzo

Mazza

, et al. A weighted artificial bee colony algorithm for influence maximization. Online Social Networks and Media 2021; 26: 100167.

54.

Namtirtha

Dutta

. Weighted kshell degree neighborhood: a new method for identifying the influential spreaders from a variety of complex network connectivity structures. Expert Syst Appl 2020; 139: 112859.

55.

Liu

Wei

, et al. Identifying influential spreaders by weight degree centrality in complex networks. Chaos. Solitons & Fractals 2016; 86: –7.

56.

Ren

Wang

Liu

, et al. Identify influential spreaders in complex networks based on potential edge weights. Int J Innov Comput Inf Control 2016; 12: 581–590.

57.

Maji

. Influential spreaders identification in complex networks with potential edge weight based k-shell degree neighborhood method. J Comput Sci 2020; 39: 101055.

58.

[dataset] Kumar

Spezzano

Subrahmanian

, et al. Edge weight prediction in weighted signed networks. In: Proceedings of 2016 IEEE 16th International Conference on Data Mining (ICDM), 2016, pp.221–230. 10.1109/ICDM.2016.0033

59.

[dataset] Kunegis

. Konect: the Koblenz network collection. In: Proceedings of the 22nd international conference on world wide web, 2013, pp.1343–1350. 10.1145/2487788.2488173

60.

[dataset] Massa

Salvetti

Tomasoni

. Bowling alone and trust decline in social network sites. In: Proceedings of 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing, 2009, pp.658–663. 10.1109/DASC.2009.130

61.

Freeman

. Centrality in social networks: conceptual clarification. Social Network: Critical Concepts in Sociology 2002; 1: 238–263.

62.

Brandes

. A faster algorithm for betweenness centrality. J Math Sociol 2001; 25: 163–177.

63.

Ruhnau

. Eigenvector-centrality—a node-centrality? Soc Networks 2000; 22: 357–365.

64.

Spizzirri

. Justification and application of eigenvector centrality. Algebra in Geography: Eigenvectors of Network 2011.

65.

Page

Brin

Motwani

, et al. The PageRank Citation Ranking: Bringing Order to the Web. In: The Web Conference, 1999.

66.

Campiteli

Holanda

Soles

, et al. Hirsch index as a network centrality measure. 2010. 10.48550/arXiv.1005.4803

	Research Contribution
			Connection Strength	Existing Diffusion
Reference	Seed Selection Algorithm	Diffusion Model	Considered	Model Used
⁵	IC-P Greedy	IC-P	No	IC, IC-W, TV
⁷	R-Greedy with Live-edge and Propagation-path	LT-S	No	IC-W, TV
⁴⁴	Improved Weighted Vote Rank	-	Yes	SIR
⁴⁶	Weighted Vote Ranking with Weighted Mixed Degree Decomposition	-	Yes	SIR
⁴⁷	Extended Weighted Mixed Degree Decomposition	-	Yes	SIR
⁴⁸	Extended Weighted Degree based on Remaining Minimum Degree Decomposition	-	Yes	SIR
⁴⁹	HookeRank	-	Yes	SIR
⁵⁰	-	-	No	SIR, IC, IC-W, TV
⁵³	Weighted Artificial Bee Colony	-	Yes	-
⁵⁴	Weighted $k -$ shell Degree Neighborhood Indexing	-	No	SIR
⁵⁵	Ranking based on Tuning Weight Parameter	-	No	SIR
⁵⁶	Evidential $k -$ shell Centrality based on Potential Edge Weight	-	Yes	SI
⁵⁷	Potential Edge Weight based $k -$ shell Degree Neighborhood Centrality	-	Yes	SIR

Link strength diffusion model for influence diffusion in weighted social networks

Abstract

Keywords

1 Introduction

2 Related work

3.1 Network diffusion models

3.1.1 Susceptible-infected-recovered model

3.1.2 Independent cascade & independent cascade-weighted models

5 Experiments and discussion

5.1 Dataset description

Table 3. Statistics of the three real-world social networks.* Dataset | N | | E | k m a x ⟨ k ⟩ δ ⟨ C C ⟩ Bitcoin Alpha 3783 24,186 888 12.786 10 0.078 Bitcoin OTC 5881 35,592 1298 12.104 9 0.059 Advogato 6541 51,127 943 15.632 9 0.092

Footnotes

ORCID iDs

Author contributions

Funding

Declaration of conflicting interest

References

Table 3.
Statistics of the three real-world social networks.*

Dataset $| N |$ $| E |$ $k_{m a x}$ $⟨ k ⟩$ $δ$ $⟨ C C ⟩$

Bitcoin Alpha 3783 24,186 888 12.786 10 0.078

Bitcoin OTC 5881 35,592 1298 12.104 9 0.059

Advogato 6541 51,127 943 15.632 9 0.092