Abstract
Studying complex networks is essential for a better understanding of network science. Many studies have been done on single-layer networks in complex networks. After the advancement and widespread usage of the internet and social media networks, performing community detection in multilayer networks becomes essential to reach more people and work with different personalities on different platforms. Motivated by this observation, this paper has studied types of networks, metrics, measures, and community detection using deep learning-based models in multilayer networks. This survey can play a significant role in analyzing and understanding multilayer networks.
Introduction
For two decades, complex networks have become an essential concept in network science. Complex networks have been investigated in many systems such as power grids, the world wide web, biological networks, genetic networks, and social networks [4]. Social networks such as Facebook, Twitter, Linked In, We chat emerged as the best platforms for interaction between different people. Research topics like influence maximization, community detection, and link prediction play a crucial role in social networks. Traditionally complex networks are modeled as single-layer networks, in which nodes and edges are the same behavior and characteristics. However, many natural systems do not support single-layer networks ex: C.elegans nematode. Using the multiple-layer concept, we may overcome this kind of problem. Each person might have accounts and friendship links on different platforms, where friendship links in each platform are treated as a separate layer.
So far, research and literature study of complex networks performed in single-layer networks, where the users (nodes) and interactions (edges) are assumed to belong to the same network. Neglecting some relationships in complex networks may not capture the complete real-time world systems, leading to the complex networks’ incorrect dynamic properties. So, in very recent years, we are witnessing an unprecedented change in the study of complex networks, where interacting nodes belong to different platforms, i.e., layers, this kind of networks called multislice [29], multiplex [12], multilevel [40] and general multilayer networks [19]. A variety of real-world networks have interconnections with different kinds of interactions, ex: Users can have different types of social relationships such as kinship, friendship, and vicinity ship same cultural society forms multilayer public social networks.
Research in multilayer networks is at the infant stage. Minimal data available, the study of multilayer networks interconnects with different topics in social network analysis, including dynamics of networks, link prediction, influence maximization, measures, and metrics of networks, to quantify the behavior of animals. It is related to eco-evolutionary dynamics as connected layers of interactions [17] to model the transportation system. Therefore it is essential to investigate multilayer networks in large-scale complex networks under big global data. As a result, we aim to present a clear picture of multilayer networks.
In recent years, different kinds of measures and metrics have been defined differently from the perspective of multilayer networks. These measures and metrics, such as degree centrality, node overlapping, edge overlapping, page rank centrality, clustering coefficient, and eigenvector centrality, have been extended to identify communities, influential nodes, and link prediction in multilayer networks. For a long time, not only have sociologists realized that it is essential to study the social relation set of nodes based on relation type, but anthologists researchers also proved the importance of diverse social network relation types [41]. So community detection in multilayer networks has also become an important topic in discussions. Similarly, spreading the data in different networks through overlapping nodes in the different layers using information diffusion models also sparked many researchers. We need synthetic networks and real-time data sets to run the implemented algorithms. Many synthetic network packages have been implemented in multilayer networks. We discussed some of them as well as some real-time data sets.
The dynamics of the spreading process in multilayer networks are influenced by many properties like interconnected networks and multiplex networks [33]. The spreading process of interconnected networks studies the impact of inter-layer patterns and interaction strength between layers. Interaction strength between layers depends on some measures; they are the second nearest neighbor, interconnection topology measure, and inter-layer link density. The impact of the inter-layer pattern (i.e., how nodes are connected in different layers) in interconnected network measures based on inter clustering coefficient and inter-degree-degree correlation. Various topics on the spreading process have been discussed recently in multiplex networks. Intra-layer structure, layer similarity, partially interconnected multiplex networks, layer switching cost, are discussed. The properties of the spreading process are discussed, the role of network structure and spreading velocity, interacting spreading process, diffusion of innovations, and resource constraints. Jiexin et al [42] presents the traffic dynamics on multilayer networks. Authors in [42] explains performance indicators such as network capacity, average transmission time etc. They also discusses optimization strategies for improving transmission performance. Walter et al. [14] propose multilayer quality and grade of service support for high speed networks. It proposes a framework for multilayer QoS and GoS support in GMPLS based IP/WDM networks.
Community detection in multilayer networks is one of the research areas with increasing practical significance. Lancichinetti et al. [22] proposed two problems. One is hierarchical community structure; large communities are composed of small communities. The second is overlapping communities when a node is in more than one community like family, friends, professional, etc. Above mentioned traditional communities are applied to small networks and in simple cases. However, real-world networks make traditional methods less applicable in complex and multilayer networks. The deep learning techniques offer optimistic solutions in community detection in multilayer networks. Therefore community detection using deep learning techniques is a new trend, especially in multilayer networks.
To the best of our knowledge, this is the first attempt to study Community detection, including deep learning techniques in multilayer networks. The significant contribution of the proposed work is given as follows.
We have done a study on various networks models of multilayer networks.
We provide a detailed study of measures and metrics used in multilayer networks.
We provide a brief study of community detection, including deep learning techniques in multilayer networks.
Applications of multilayer networks introduced and as well as future scope of the multilayer networks.
This section provides a detailed introduction of multilayer networks, the rest of the paper organized in the following way. In Section 2, we discussed the network model, i.e., multilayer networks and multiplex networks. Section 3 presents metrics and measures of multilayer networks and aggregating the many networks into a single network, i.e., coupling networks. Section 4 briefed on dynamics of multilayer networks; it consists of a broad range of research in the domain of community detection. In Section 5, we presented the applications of the multilayer networks. Finally, Section 6 presents about Conclusion and Future Work.
Network model
The Graph consists of many networks, and each network treats as a layer. Many authors proposed many network models for the definition of multilayer networks. We discussed two main models as multiplex networks and multilayer network models. Most of the authors aggregate all the layers in a multilayer network into a single network. Given a Multilayer Graph
Multiplex network
Multiplex is a particular type of multilayer network. Each layer has an exact number of nodes in a multiplex network, and possible interlayer connections exist between replica nodes. Intralayer connections can exists between any of two nodes in the same layer. Representation of M-layer Multiplex network is
Random walks are an easy way to explore the multiplex networks [7]. Different kind of biased random walks has been studied. Degree-biased random walks are used in many cases like finding centrality measures, community detection, influence maximization algorithms. Since many interconnected layers require many biased random walks, mainly divided into intensive and extensive walks.

(A) Multiplex network. (B) Multilayer network.
A multilayer network model is intra and interconnected graph network
In a multilayer network, each layer may not have the same number of nodes; nodes also may be different. In general a node u in layer X can be connected to node v in any layer Y [1]. Layers will represent the node’s character, behavior, and aspect and links in that layer.
If
Given a graph
Any two nodes can be connected in the network, i.e., nodes in the same layer or nodes in different layers. Most of the study has been performed in multiplex networks, and study in multilayer networks is minimal because most of the reference papers are in multiplex networks.
Metrics and measures of multilayer networks
Traditional measures and metrics in a single layer are also defined in multilayer networks, but some metrics are defined differently in multilayer networks. This section discuss the different metrics in multiplex networks, including edge overlap, clustering coefficient, degree centrality, page rank centrality, eigenvector centrality.
Battiston et al. [6], introduced basic metrics are, i.e., node overlap and edge overlap in the multiplex network. Edge overlap is defined as there exists an edge between two nodes in the same layer occurs in different layers, then edge overlap of an edge
Where
Node i of the degree in layer α is
The clustering coefficient in a monoplex network is the ratio of the number of links between the neighbors of a node to the maximum possible number of links between neighbors.
Degree centrality is a measure to identify the neighbors of a node v, degree centrality (DC) of node v is
Degree centrality of a node
Singular Vector of Tensor (SVT) centrality [38] used to quantitatively find the importance of nodes connected by different kind of connections in multilayer networks.
Page rank centrality [37] is a measure to give ranking to the nodes based on incoming edges to particular node. Page rank
In a single-layer network, the eigenvector centrality of node i is defined as the
To measure the impact of layers interrelation on information spreading process in between two layers correlation coefficient (CrC) [43] can be used, the CrC for two layer network is:
Pearson correlation coefficient (PCC) [28] is a measure to find the linear correlation between two nodes. PCC is used to find overlap between three layers estimated from pairwise PCC in multilayer networks. Pearson correlation coefficient i.e
Cross-layer degree centrality (CLDC) [10] is defined as the sum of cross-layer in-degree centrality and cross-layer out-degree centrality.
Whereas cross-layer in-degree centrality is the ratio between the sum of the edge weights incoming to node a towards multilayer neighborhood
Whereas cross-layer out-degree centrality is the ratio between the sum of the edge weights that are outgoing from node b towards multilayer neighborhood
Here
Coupling of the networks
Interlayer connections between replica nodes in multilayer networks happen through coupling; Murata et al. [31] discussed two types of couplings, i.e., ordinal and categorical. In ordinal coupling, the inter-layer edges exist only between replica nodes adjacent layers. In categorical coupling, the inter-layer edges exist between replica nodes in all the layers.
Coupling the nodes in the network happens via lossy or lossless coupling schemes [44]. In a lossless coupling scheme, the resultant network will maintain all the features and quality of the original network without any loss. The lossy coupling scheme offers an alternative solution that may compromise quality, and the resultant network may also lose some features. Generally, lossy coupling uses when memory and time constraints exist. Different lossless coupling schemes use to couple the multiplex into a single layer network; they are clique lossless coupling scheme, star lossless coupling scheme, reduced lossless schemes, and extension to another diffusion model.
Community detection in multilayer networks
Complex networks are inherently organized as communities, i.e., clusters or modules with sparse external links and dense internal links. Since communities have common properties, community structure helps better understand and overall function of the network. Community detection has been studied extensively in single-layer networks for many decades. Researchers are working on community detection in multilayer networks (MLCD). Practical approaches for finding MLCD can be divided into three main categories, first is direct methods, second is flattening methods, and third is aggregating methods [35]. Indirect methods directly identify the multilayer networks as inputs by optimizing some multilayer quality assessment criteria. In flattening methods, the single-layer will be detected on multilayer networks; after the detection of single layer network, apply conventional community algorithm to detect communities. In aggregation methods, community structure identifies on each network layer separately; after that aggregation, the mechanism will be used to identify the final community structure.
Xing Su et al. [34] proposed five deep learning categories for community detection. They are Conventional networks, Generative Adversarial Network (GAN), Graph Attenuation Network (GAtN), Deep Sparse filtering, Auto Encoder. GAtN is crucial at special attention to community signals. The conventional network consists of conventional neural networks and graph conventional networks (GCN). Auto encoder has subcategories; it consists of noise auto encoder, graph attention auto encoder, and variational auto encoder.
Cao et al. [11] proposed deep neural networks for graph representation (DNGR), it applies denoise encoder to increase the robustness for capturing structural data when detecting communities in multilayer network. Li et al. [23] proposed clustering oriented network using deep networks and it is extension to DNGR to connect the cluster assignments. Mehta et al. [27] proposed overlap stochastic block model is a fast recognition model that enables the fast inference of the node to detect the communities across the network.
Researchers use modularity measures to find the strength of community; the modularity concept uses for several optimization methods to discover communities in the network, including spectral division, external optimization, greedy agglomeration, and simulated annealing. Ma et al. [26] proposed a new algorithm semi-supervised joint non-negative matrix factorization (S2-JNMF) to find communities in multi-layer networks. S2-JNMF consists of three stages, one is priori information construction, the second is non-negative matrix factorization, and the third is community detection. Priori information has two steps; in the first step, the greedy step strategy has been used to identify dense subgraphs in a multi-layer network. In the second step, the strategy starts at one node called seed node, and it keeps expanding the subgraph until it finds the density of the subgraph in some layers of the multi-layer network is less than the predefined threshold. In non-negative matrix factorization, based on subgraph, construct the semi-supervised matrix. In community detection, obtain the community using a matrix. The time complexity of the algorithm is
Pramanik et al. [32] proposed new modularity index called multilayer modularity index (
With the complexity of multi-layer networks, it has become more complicated to get the actual data of the network. Even though we get the global structured data, computation task might be much high, it may not be possible in real-time networks, so to overcome this problem, XiaoMingLi et al. [25] proposed a new community detection algorithm called multi-layer social network based on a local random walk (MRLCD), it has two stages one finds the core node, core node determines based on repeatability of the node in the multi-layer network. The second is the core clustering stage based on a local random walk. In the random walk, find the scope of the node based on intra and interlayers trust.
Interdonato et al. [18] proposed local community detection in multi layer networks(ML-LCD) method takes multilayer network and seed node v as input. It computes the local community C is associated to v. Different measures have been used to compare the results in ML-LCD are ML-LCD-lw on layer weighting, ML-LCD-wlsim on with in layer similarity and ML-LCD-clsim on cross layer similarity, Ml-LCD-lw and ML-LCD-wlsim is
ML-LCD-clism is
Brodka et al. [9] proposed a new measure called cross-layer edge clustering coefficient (CLECC), which was developed based on the idea of edge clustering coefficient. Calculate CLECC for all the possible pairs of x, y in layer α. Then remove all low coefficient edges and recalculate them. The process repeats until subgraphs or isolated nodes occur. Subgraphs are communities in a multilayer network.
Kuncheva et al. [21] proposed an algorithm locally adaptive random transitions (LART) to find communities in the multiplex network, the algorithm based on the random walk in multiplex and transition probabilities. The random walk will perform within and across in multilayer networks; a node
Tang et al. [36] discussed to extracting structural features from each dimension of networks using modular analysis, then integrate all of them to find out community structure. Three kind of modularity analysis concepts has been discussed i.e. Average modularity maximization, total modularity maximization and principal modularity maximization. In average modualrity maximization, treat M-layer as 1- layer by aggregating the network and Average maximize modularity is
In total modularity maximization maximizes the total modularity in all the layers, it considers degree distribution in each dimension is different where as in AMM degree distribution is same. Total maximization modularity is
Where
In AMM and TMM, different dimensions have been aggregated into single-layer networks. However, in PMM, extract the structural features at each dimension then perform cross dimensional integration. Principal modularity maximization consists of two steps, one is structural feature extraction, and the second is cross-dimension integration. In structural feature extraction, extract the features from each dimension via modularity maximization; Then apply the principal component analysis (PCA) on the concatenated data to identify top eigenvectors as modularity matrix. After projecting the data onto the principal vectors, Compute low dimensional embedding with the help of the top eigenvector of modularity matrix and then perform k-means on the low embedding to find the possible communities. As part of the comparison, they compared AMM, TMM, and PMM with single-layer modular maximization. Performance seems to be like
Amelio et al. [3] proposed a new method multilayer genetic algorithm(MultiGA) to detect the community structure in multilayer networks. MultiGA adopts the genetic representations of individuals that allow cooperation and co-evolution among all the network layers. Each individual has n genes, after decoding the genes, evaluate community structures(CS). After identifying community structures, A new modularity concept has been introduced for evaluating fitness function by combining modularity values computed for each layer. Let a and b are layers of multilayer network,
Where
Similarly combined modularity of
The total combined modularity of
Mucha et al. [30] proposed multislice modularity for knowing the fitness of communities in multilayer networks. For quantifying communities in multilayer networks, quality functions have been used; quality function compares the number of intracommunity edge weight minus expected at random fails to provide any contribution from the interlayer coupling. Based on quality functions, multislice generalization of modularity has generated,
Where
In the above
Where
Jianxin et al. [24] proposed community diversified influence maximization (CDIM) to address the computational complexity in previous models. It uses the CPSP-tree index to find k influential users in the social network. Yuqing et al. [45] propose influential pricing nodes in the networks based on their expected influence spread. Authors in [45] design a function to characterize the relationship between price and expected influence.
Kumar et al. [13] propose IM using Extended h-index and Label Propagation with Relationship matrix (IM-ELPR). IM-EPLR has four phases, first is the seeding phase to select the candidate seed nodes, second is the label propagation phase to find the communities with the help of seed nodes. The third phase is to merge the communities using a relationship matrix, and the fourth phase is to select the k influential nodes using the rank measure. Xuanhao et al. [15] propose community-based influence maximization in Location-based social networks (LBSN), which comprises three steps. First step is to find the communities based on Spatio-temporal similarity measures between users. The second step finds the candidate nodes using the candidate selection algorithm. In the third step, it uses two community detection algorithms to find final seed nodes. Table 1 explains the different community detection methods and its functionalities.
Community detection algorithms in multilayer networks
Community detection algorithms in multilayer networks
In this section we discussed some applications of multilayer networks. we did not cover subject applications detail, but we kept more references, interested readers can go through easily [1].
Transportation system
There are many datasets on multiplex networks like EU air transport, London metro system or Six Continental airlines, etc. There can be several ways of transport mode like bus, rail or air, etc. Any of the transport modes can be used between any two locations, transport services follow different routes, stations/ locations are the nodes, and paths they follow are edges of the network. Alessio Cardillo et al. [13] studied the dynamics of the European air transport network, where each airline is a layer and airports are nodes of the multilayer network.
Multilayer networks in social networks
Multilayer networks in sociology uses in many places like marketing, product advertisements, virus spreading using Information diffusion models after finding influential nodes in different networks [5,16,20,39] and to find similar kind of people in networks [3,21,26,32]. Other than this, we can also use to find the future friendship links in different networks with help of prediction algorithms. We also discusses the biological networks here. Studying the biological elements and their interactions are crucial for knowing about biomolecules and biological process. Research in biological networks aims to organize the biological structures to desired states by manipulating signals. Zheng et al. [27] has proposed a framework for controlling nonlinear dynamical systems with the dynamic approach in multilayer networks and applied target identification of diseases. Similarly, some other applications. Shinde et al. [24] studied protein to protein interaction analysis in multilayer network analysis of different life stages in Caenorhabditis elegans, where each layer represents a life stage of bacteria. Zitnik et al. [45] discussed the prediction of multi cellular function through multilayer tissue network.
Conclusion and future work
This study provides a comprehensive overview of the state of the art measures, metrics and community detection in multilayer networks. In the last decade, most of the community detection studies done on single layer network. According to our study, these methods are efficient, effective and applicability of community detection in multilayer networks. As communities have more edge denisity, nodes in communites do have frequent interaction with other nodes in the community. This property will helps in increase in the information spread in the community as well as network. Therefore, community based models can give better performance than traditional algorithms.
In the future, we will explore influence maximization, community detection and link prediction in multilayer networks using different domains such as game theory, recommender systems, etc. To the best of our knowledge, few researchers works on influence maximization using community structures in multilayer networks.
Conflict of interest
None to report.
