GT-CHES: Graph transformation for classification in human evolutionary systems

Abstract

While increasingly complex algorithms are being developed for graph classification in highly-structured domains, such as image processing and climate forecasting, they often lead to over-fitting and inefficiency when applied to human interaction networks where the confluence of cooperation, conflict, and evolutionary pressures produces chaotic environments. We propose a graph transformation approach for efficient classification in chaotic human systems that is based on game theoretic, network theoretic, and chaos theoretic principles. Graph structural properties are compiled into time-series that are then transposed into the frequency domain to offer a dynamic view of the system for classification. We propose a set of benchmark data sets and show through experiments that the approach is efficient and appropriate for many dynamic networks in which agents both compete and cooperate, such as social media networks, stock markets, political campaigns, legislation, and geopolitical events.

Keywords

Graph classification graph time-series chaotic systems

1. Introduction

Human systems, consisting of agents and their interactions, tend to evolve chaotically due to natural selection, the blend of cooperation and conflict that such natural selection induces over time, and the maximal adaptability due to self-organization [52, 53]. The field of game theory has expanded considerable efforts in attempting to quantify solutions for human interactions. Solution concepts have been formulated for special types of interactions, such as pure cooperation [65] and pure conflict (also known as zero-sum games) [76]. However, for the general case, namely a mix of conflict and cooperation, there is no universal notion of optimal decision-making.

Unsurprisingly, most human interactions are mixed-motive, as well as non-reductionist, so that it is not possible to partition them into their structured and randomized subsets, solve each subset separately, and combine the results. Hence, understanding the dynamics of human (highly-connected) systems requires the synthesis of elements of network theory (network structure), game or interaction theory (strategic behavior), and non-linear system dynamics or chaos (feedback effects) [17]. We seek to improve the accuracy of classification in human systems by synthesizing solution concepts from these various theories.

In general, the system’s state cannot be adequately represented by a vector of measurements, nor indeed even estimates, of the properties of its constituent parts, as is typically done in most time-series analysis, whether it be the inputs to estimating auto-regressive embedding dimensions [73, 55, 71, 62] or the inputs to echo state architectures [30]. Hence, learning labels for the constituent parts of a system, as in node or link classification, is fundamentally different from learning labels for the system as a whole, as in graph classification.

As such, our proposed approach is to transform a graph time-series classification task into a standard graph classification task by encoding the time-series nature of a number of relevant graph structural properties through fast Fourier transforms. We then use a random forest algorithm to predict the label of each graph through time.

The rest of the paper is organized as follows. Section 2 gives a review of some of the most relevant related work. Section 3 describes our novel approach to graph classification in chaotic human systems. A number of experiments that demonstrate the validity and efficiency of the approach, as well as comparisons with competing methods inspired by state-of-the-art deep learning architectures, are described in Section 4, together with novel benchmark data sets. Finally, Section 5 concludes the paper.

2. Related work

Our focus is on classification tasks for complex, dynamic, chaotic, human systems. Lines of inquiry in classification learning have generally been restricted to one or two of these system characteristics. We review the most relevant ones here.

Many machine learning and statistical tasks usually are performed in domains with complex, yet highly structured data; thus, a stable data distribution can be assumed. Such is not the case for dynamic systems, where data shift, rather than stability, is the norm. The shift can be a product of an intractably complex system or due to the forecasts themselves in the case of interactive systems. Work has been done in the area of complex adaptive systems, where an explanatory model of the system rather than a black-box prediction are preferred. [18, 1], as well as in reinforcement learning and no-regret learning, especially in the context of autonomous robots where dynamic learning and predictions are needed [20].

Deep learning (DL) has recently allowed major advances in highly complex, yet rather structured and non-chaotic, human domains, such as natural language processing and image classification tasks. The key to moving from the early multi-layer perceptron to the modern state-of-the-art deep learning architectures is the back-propagation of error method [61, 42, 9], which allows for a priori knowledge to be built into the network architecture [41], especially in the form of convolutional layers [36] that essentially act as feature detectors [27, 40]. Further improvements have been made possible, especially in the context of natural language processing, through attention [75] and recurrence [28] mechanisms. Most recently, deep learning has been extended to graphs in the emergent field of graph neural networks (GNNs) [64]. Since all permutations of the orderings of nodes and edges produce isomorphic graphs, tabular representations of graphs for consumption by standard machine learning techniques have proven largely unsatisfactory. GNNs relax the notion of isomorphism in favor of smoother metrics associated with graph structure that can become data points for a standard vector representation [35]. The idea is to use convolution and pooling to extract these permutation-invariant properties [8, 79, 81, 43, 48, 80, 78], where convolution creates embeddings for each node through embeddings of their neighbors, and pooling essentially extracts information from subgraphs. Significant challenges faced by GNNs in their use for human systems include pooling assumptions aligned with unsigned graphs [16], loss of spectral properties for directed and signed graphs [8, 34], weights and signs combination for propagation pooling operators [15], staleness [59], and difficulties with dynamic graphs [8].

DL work is being extended to chaotic physical systems and some human systems. This field is rapidly growing as graphs are now taken as inputs to novel formulations of DL components (and sometimes the components being formulated as nodes of a graph to be learned [21]). We review a few here that represent the general direction taken, which involves composing recurrence and GNN architectures. Work on pollution often combines graph- and recurrence-based neural networks in order to capture both the spatial and temporal nature of the spread of PM_2.5 (Particular matter with diameter less than 2.5 $\mu$ m). The graph convolutional neural network long short-term memory (GCN-LSTM) introduced in [57] employs a GCN to extract spectral features from the input matrix. The spectral features are concatenated with the matrix to form the inputs to the LSTM. The Fourier coefficients that determine the spectral features are trained by backpropagation. Another approach to the same problem is the graph-based long short-term memory (GBLSTM) [21]. In this formulation, each forget gate of the LSTM represents an air quality monitoring station rather than an entire graph. The gates are connected by an adjacency weight matrix whose weights are learned through training. Hence, this novel approach treats DL components as a graph whose weights are learned. These methods have proven to be state-of-the-art. We note, however, that the recurrence mechanism is effective in capturing the spatial quality of the problem since the deterministic nature of physical systems is more structured than that of human systems. For human systems, a combined graph and DL approach is being taken for link and node prediction. The graph convolutional long short-term memory (GC-LSTM) proposed in [12] inputs a series of graphs into the LSTM forget gates that act as an encoder. The output of the forget gates goes into a fully connected layer that acts as a decoder, and outputs an estimation of the graph at the next time step. We see the use of attention mechanisms for link- and node-classification in temporal graphs [77, 82], however, these often lack a specific inductive bias – such as structural balance or node strategies. This reliance on universal approximators to sift out temporal behaviors in highly complex systems, rather than assuming a set of behaviors upfront, leads to overfitting the training data rather than generalization [41]. Other methods do search for behavioral motifs, such as triadic closure, for node and link prediction [50]. Such tasks do fall within the human evolutionary domain, but they concern less chaotic atomic-level predictions rather than system classification.

Reservoir computing (RC), on the other hand, has been successful at addressing chaotic, but less human-like tasks. Early efforts to predict a chaotic signal estimate the hidden state through a delayed embedding [73, 60]. This approach converts a temporal problem into a spatial one, but also introduces an artificial time horizon, an awkward representation of time, and many parameters [66]. Recurrent neural networks (RNNs) were introduced in conjunction with back-propagation [61], with an elegant theoretical justification as an Euler approximation of time-dependent ordinary differential equations [32]. However, it has been shown that RNNs cannot perform over an arbitrary time period due to the vanishing gradient problem, are not robust to noise, and lack efficiency [3]. The echo state networks (ESNs) and liquid state machines (LSMs) were introduced as a way to employ recurrence like RNNs and to convert the temporal problem to a spatial one, as Takens had done [73], but in a way similar to kernel methods and support vector machines [7]. The idea is that reservoirs of randomly connected networks can act as “a complex nonlinear dynamic filter that transforms the input signals using a high-dimensional temporal map” [66], giving rise to the notion of reservoir computing. The property that has enabled RC to be successful with short-term predictions of chaotic systems is the echo property [29], which informally states “that the current state of the network is uniquely determined by the network input up to now, and, in the limit, not by the initial state. The network is thus state forgetting.” [66] A significant difference between RC and DL is that DL is built on error backpropagation, while RC is built on a feed-forward architecture. In DL, weights are adjusted iteratively according to how a prediction matches the true label; in RC, the inputs change the activation node values, while the weights are held steady. DL learns until it converges to a steady state; RC never enters a steady state. DL is effective for tasks in which the true hypothesis is relatively fixed; RC is best suited for tasks whose true hypothesis changes chaotically. RC provides a clever improvement on the computationally expensive BP process with respect to temporal prediction tasks. The RC approach allows the network to be flexible, yet employ “massive short-term memory” [30] to estimate a nuanced model of recent samples, while forgetting such nuances from samples of the distant past. One underlying issue with RC is that it is dependent on hyper-parameters that are not tuned during training. Current lines of research focus on tuning these hyper-parameters efficiently [32]. Another weakness of RC is its black-box nature, which limits its ability to predict a complex system without attempting to model its underlying structure.

Early violence detection (EVD), in the field of political science, focuses on dynamic human prediction tasks, but scopes such tasks to make traditional statistical methods of analysis suitable. The white-box nature of EVD allows the direct comparison of a proposed model with political theory. The main approach is to identify a few critical factors associated with geopolitical instability and make predictions with logistic regression and Bayesian methods [67], or other machine learning methods [69]. The approach favors smaller data sets of aggregated data over large data sets of repurposed data. Interestingly, this research has identified network-theoretic power structures as most relevant to conflict [23, 74], and has demonstrated outstanding effectiveness, such as 80% accuracy for two-year forecasts. EVD brings two critical insights to graph classification for human systems. First, good data on the power structures of the systems is the most relevant for global predictions [67]. Second, the dynamics of systems composed of larger organizations tend to change more slowly and more predictably than systems of individuals [68]. Hence, to analyze a highly-dynamic, decentralized system which does not exhibit explicit power structures, one may want to 1) identify the implicit power structures and 2) make such an identification efficient computationally in order to keep apace with changes. Our method seeks to do precisely that.

3. Graph classification in chaotic systems

We seek to bring together the solution concepts for dynamic systems, and cooperative and competitive interactions to improve prediction in the mixed-motive environment. Although the latter two concepts only offer partial solutions to the mixed-motive game, they provide the insight needed to attempt to go beyond simply modeling some low-dimensional input-output signal (as in the Reservoir Computing case) to modeling the internal workings of the system. Essentially, we convert the problem from a purely temporal one to a spatial one, thus combining the various solution concepts in a way that captures the chaotic dynamics ensuing from the mixed-motive interactions, as follows.

1.
Convert event data into dynamic graphs, which are higher-order representations of cooperation and conflict.
2.
Transform dynamic graph measures to the frequency domain, which is a higher-order representation of the evolution of cooperation and conflict.
3.
Embed the frequency domain distributions into a suitable space for classification.

In step 2, graph data (cooperation) is transformed to the frequency domain (conflict) so as to estimate properties of the chaotic attractor (dynamic systems). In step 3, the resulting feature vector includes static data about the graphs (cooperation) and dynamic data that captures the frequency domain distributions (conflict) of the graph data. We neither claim that these are the only solution concepts for cooperation and conflict, nor that this is the optimal way to combine them. We will show, however, that it improves prediction over other approaches in a number of real-world applications.
3.1 Graph representation: Modeling cooperation and conflict

Graphs are the most natural abstraction for the representation of interactions among agents. The structure of the graph is a product of the cooperation and conflict within the system it models, and that structure evolves as the balance of cooperation and conflict changes through time.

We focus on tasks whose corresponding graphs are derived from event data. An event is defined as a quadruple $\{t$ , src, dest, $\textit{val}\}$ where $t$ is a time stamp, src is the initiator of the action, dest is the recipient of the action, and $\textit{val}\in[-1,1]$ is the intensity of the action. The edges of our dynamic graphs are directed from src to dest with weight val. We select a few representative network properties, inspired from the literature, to characterize graphs at each time step, as shown in Table 1. Node-level properties, such as eigenvector centrality, are transformed into graph-level properties by computing the correlation between the list of values of that property and the corresponding list of values of some application-specific notion of node influence. Intuitively, if the correlation is high, then we may assume that the associated property generates high influence, and vice versa. We denote by $S$ the set of structural properties.

Table 1
The feature set $S$ of graph structural properties

Property	Level	Description
Fairness (fair) [39]	Node	Degree of a node’s accuracy in judging Goodness
Goodness (good) [39]	Node	Degree to which a node is trusted by agents with high Fairness
Eigenvector Centrality (eigen) [5, 4]	Graph/Node	Measure of a node’s status in relation to the status of its
		neighbors
Structural Balance (struct) [25, 10]	Graph	Degree to which a friend of a friend is a friend
Assortativity (assort) [51, 45]	Graph	Degree to which highly connected nodes connect with one
		another
Trust Propagation (prop) [24]	Graph	Estimation of the linear transformations of actions to reactions
Clustering Coefficient (clust) [63, 54, 14, 19]	Graph/Node	Degree to which a node’s neighbors are connected to each other
Gini (gini) [11]	Graph	Degree to which influence is unequally distributed among a
		population
s-metric (s-metric) [46]	Graph	Sum of the pairwise products of node degrees

We note that three of the features in $S$ have several variants: eigen has four variants based on the value of its parameter $\beta=\{-1,-.5,.5,1\}$ ; assort consists of 6 combinations of weight considerations ( $+(+,+)$ , $+(+,-)$ , $+(-,-)$ , $-(+,+)$ , $-(+,-)$ , $-(-,-)$ ), where the first terms relates to the edge sign that defines neighbors, the first term in the parentheses refers to the type of signed term for the source and the last term refers to the type of signed term for the target; and prop consists of three strategies, namely direct propagation, co-citation, and tit-for-tat, together with an error term.

3.2 Frequency domain representation: Modeling evolution

In a mixed-motive environment, the elements of cooperation and conflict induce structure and randomness, respectively. For human systems, the additional element of evolutionary pressure induces chaos, resulting in a chaotic trajectory of the system [58].

We require some way to synthesize the notions of cooperation, conflict, and chaos by estimating the level of chaos in the trajectory’s attractor. Just as the graph is a higher-order representation of human interaction, the frequency domain becomes a higher-order representation of how the balance of cooperation and conflict fluctuates dynamically. The frequency domain captures global properties of a dynamic system, one of which is the structure of its trajectory. We therefore propose to capture the trajectory of the structural properties introduced above by converting them to the frequency domain through a Fast Fourier Transform (FFT).

We can then estimate the degree to which the resulting distribution is uniform in order to gauge randomness, or the degree to which it fits a distribution that would represent a structured trajectory. To illustrate the value of this approach, consider Fig. 1, which depicts three different time series and their corresponding FFTs. In addition to the FFTs, the graph shows uniform (flat, dark gray) and gamma (light gray) distribution fits. The first distribution (1a) exhibits high structure, monotonically decreasing at a constant rate. Its FFT (1d), has pronounced amplitudes in two narrow frequency ranges, and as expected does not fit either uniform nor gamma distributions very well. The second distribution (1b) is random, with no underlying structure. Its FFT (1e) exhibits similar randomness, and as such fits an uniform distribution. Finally, the third distribution (1c) represents the Dow Jones daily closing levels over a period of 100 days. It exhibits chaotic behavior (i.e., both some structure and some randomness), and its FFT (1f) shows a range of pronounced amplitudes at the higher frequencies, as well as good fit to a gamma distribution, as expected for such a chaotic trajectory.

Figure 1.

Different time-series and their corresponding FFTs.

We denote by $\textit{FFT}(S_{i},t-w:t)$ the FFT of structural property $S_{i}$ over the time window $[t-w,t]$ , where $w$ is an application-dependent hyperparameter. After fitting each FFT for $S_{i}$ to a gamma distribution, we compute a goodness-of-fit measure $D_{i}$ . We denote by $D$ the set of all of these measures. In addition, we track the interplay of the structural properties’ trajectories by computing a multidimensional Fourier Transform over them, denoted by $\textit{MFFT}(t-w:t)$ . As with the individual trajectories, the resulting function is fit to a gamma distribution. It is also fit to a log normal and an uniform distributions, and goodness-of-fit values are computed. They are denoted by $MD_{g}$ , $MD_{l}$ , and $MD_{u}$ , respectively. We expect the fit of $\textit{MFFT}(t-w:t)$ to not vary from a gamma distribution for long.

3.3 Graph classification algorithm

We now transform our graph time-series into tabular form by transforming each graph $G_{t}$ with the following algorithm.

: GT-CHES Algorithm[1] Extract the values of the properties in $S$ from $G_{t}$ each property $S_{i}$ in $S$ Compute $\textit{FFT}(S_{i},t-w:t)$ . Extract the values of goodness-of-fit $D_{i}$ of $\textit{FFT}(S_{i},t-w:t)$ to the $\Gamma$ distribution Compute $\textit{MFFT}(t-w:t)$ Extract the goodness-of-fit values $MD={MD_{g},MD_{l},MD_{u}}$ of $\textit{MFFT}(t-w:t)$ to the gamma, the log normal, and the uniform distributions, respectively

As in all classification tasks, each graph $G_{t}$ is further labelled with a value from the corresponding application-specific set $Y$ of possible classes. Given the above transformation of graph data into traditional tabular data, where vectors of values drawn from features in $S$ , $D$ and $M D$ are mapped to values in $Y$ , the graph classification algorithm of GT-CHES can now be drawn from the rich pool of candidate classification learning algorithms. We choose to use a bagging ensemble of 10 random forest models, each containing 15 trees.

Note that the proposed transformation allows our applications to still benefit from being framed as graph classification problems, as opposed to just using a feature vector of other related data to classify, or make prediction, at each time step. The graph transformation step allows us to leverage network structure and interaction. Indeed, the feature vector does not consist of aggregated data (e.g., total attacks, number of attacks, etc.) but of graph centrality measures that represent levels of organization and randomness, which are hard to measure with aggregated (non-graph) data.

4. Experiments

We encountered challenges to assessing the value of GT-CHES and comparing its results against those of others. There are very few models that have been designed specifically for the kinds of problems we address. Two fundamental implications follow from this. First, there is a lack of benchmark data sets, where the domain consists of events in a mixed-motive, evolutionary environment, and the data is sufficiently heterogeneous (i.e., directed, weighted, signed, temporal edges). Second, there are a lack of frameworks for making comparisons if suitable data were obtained for graph classification.

We begin with data sets that may be used as proof-of-concept in the sense that they include mixed-motive event-based data as input, but we have to formulate the labels since no graph labels exist for such data sets. Since these are limited, we also design benchmark data sets, which can henceforth be used by others.

4.1 Proof-of-concept data sets

We consider two cryptocurrency networks, namely the Bitcoin Alpha (BCA) and the Bitcoin OTC (BCO) data sets [39, 38]. No governing figure regulates the cryptocurrency, so users self-regulate potential fraudulent or risky behavior by rating each other in terms of trustworthiness. This is crucial in the Bitcoin community since the appeal of the currency is the decentralized nature of transaction management. In this context, the generic game theoretic terms “cooperation” and “conflict” are embodied as trust and distrust, respectively, for Bitcoin users. A user may rate another user only once, and the rating carries through time. Hence, the graph at time $t$ also includes all ratings made prior to $t$ . Ratings normally range in the interval $[-10,10]$ . We map ratings to $\{-1,1\}$ depending on whether they were positive or negative, respectively. In order to obtain a sequence of graphs (i.e., a time-series), we group ratings by week, resulting in 271 graphs and 220 graphs for the BCA and BCO data sets, respectively. The nodes of the graphs are the users of the bitcoin community, and there is an edge between node $i$ and node $j$ in graph $G_{t}$ if user $i$ has rated user $j$ at some time $t^{\prime}\leqslant t$ . For the purposes of computing the set $S$ of structural properties, we define a node’s influence simply as its aggregated ratings, i.e., the sum of ratings it has received from other nodes in the network. Each graph in the series is labeled based on the percentage of weekly negative ratings, using four discrete classes, $Y=\{\texttt{first},\texttt{second},\texttt{third},\texttt{fourth}\}$ corresponding to the four quartiles.

Eurovision is one of the longest-running international songwriting competitions, organized annually since 1956. Each participating country (mainly European) submits an original song that is performed on live television and radio, with competing countries casting votes for the other countries’ songs to determine a winner. Outcomes and statistics from annual contests are collected on the Eurovision’s official web site (eurovision.tv). However, they are difficult to exploit directly. Borsteinn Adalsteinsson has scraped and cleaned the data using an R script, and made it publicly available for download and analysis from his web site (https://data.world/rhubarbarosa/eurovisionvotingstats). Voting values range from 1 to 12, and are based on a complicated voting scheme of jury and televoting. The points are normalized to the range $[0,1]$ , scaled by the maximum value. Note that semi-final events are considered to be separate events. For certain years (e.g., 2008), there are three events (first semi-final, second semi-final, grand final); others have two events (semi-final, grand final); and others just have a single grand final. The nodes of the graphs are the participating countries, and there is an edge between node $i$ and node $j$ when country $i$ assigns points to the song of country $j$ . Since all countries vote for all other countries, all of the graphs are complete. For the purposes of computing the set $S$ of structural properties, we define a node’s influence simply as the total number of points it receives from all other nodes in the graph. Each graph is labeled by the world region in which the winning country is found, namely Balkan (1), Baltic (2), East (11), West (59), and Independent (8).

4.2 Benchmark data sets

In order to better assess GT-CHES and compare it against other relevant approaches, we design novel benchmark data sets. We anticipate that these will be useful beyond our own work and make a valuable contribution to research in graph classification in human, chaotic systems.

The problem of designing an acceptable benchmark reduces to assembling a corpus of data sets consisting of graph-label pairings in which:

1.
A graph represents a human system, where the nodes represent a human organization, and the edges represent relationships.
2.
The graph set is temporal, signed, weighted, and directed. At a minimum, the graph needs to be temporal, but can be signed or weighted.
3.
The label set consists of a labeling for each graph that specifies a property of the system it represents.

Unfortunately, at least one of these elements is missing from extant data sets. The context of the Enron e-mail data set [33], for example, is an evolutionary environment, but the problem formulation is generally link prediction or community detection, rather than graph classification. Similarly, dynamic graph classification, while covering fields such as traffic and weather prediction, is not directly tied to human evolutionary systems. The burgeoning field of social media does cover human systems, but has principally dealt with community detection, or node and link prediction, though recently work has been done on conflict prediction on the web [37]. To circumvent this challenge, we take two complementary approaches, as described below.
4.2.1 Joining graphs and labels from existing separate sources

Here, we draw graph data and label sets from different sources that have been used as inputs and targets, respectively, though in different studies. The pairing of graphs drawn from event data in source $s_{i}$ with labels drawn from source $s_{j}$ is made possible through common encodings for $s_{i}$ and $s_{j}$ . This allows the data sets to be joined through common time stamps, agents, and actions.

We draw event data from the Integrated Crises Early Warning System Project (ICEWS) [6], where events have been extracted automatically from English-language news reports, and consist of interactions between socio-political actors for various regions of the world at various times. Events can thus be represented by a quadruple $\{t,i,j,x\}$ , where $i$ is a geopolitical agent that directs an action towards another geopolitical agent $j$ at time $t$ whose magnitude is captured by $x\in[-10,10]$ . For example, the event “Croatia and Serbia clashed,” taken from a Reuters article of April 14, 1989 is converted to $\{$ 890414, SER, YUGCRO, $-$ 10 $\}$ , where SER is the code for Serbia, YUGCRO is the code for Croatia, and $-$ 10 is the value assigned to a military conflict. For a given data set (i.e., country and time period), we organize events by month and create a graph for each month. There are many event data sets that contain events from several countries. Not all events are included for all countries to avoid the larger nations dominating the graphs. Hence, for a given country $c$ and month $m$ , events are filtered that are relevant to $c$ in $m$ . An event is relevant if $c$ is either the target or source of an event within $m$ . That is, a weighted directed edge $<i,j,x>$ is included in the graph if $t\in m$ , and $i=c$ or $j=c$ . The graphs are generally sparse as there are hundreds of actors in each graph and edges do not persist from one time step to the next.

Graph labels are then drawn from the following sources: Domestic Political Crisis (DPC) [47], Insurgency (INS) [47], International Crisis (IC) [47], Rebellion (REB) [47], Ethnic/Religious Violence (ERV) [47], Irregular Leadership Change (ILC) [2], and State-Based conflict (SB) [49]. The labels come in the form of quadruples year,month,country,class and can thus be naturally joined on month and country with our graphs. The label class is a binary value, defined by an in-country political opposition to government not amounting to an insurgency or a rebellion (DPC); a coordinated effort to overthrow a government (INS); escalating tensions between states/significant deployment of armed forces by one state in another’s territory (IC); seeking independence from a government with ongoing organized, violent actions against it (REB); violence between ethnic or religious groups that is not necessarily related to a government (ERV); unexpected leadership changes in contravention of a state’s established laws and conventions (ILC); and whether the government was at war with any group within or outside the country (SB).

4.2.2 Drawing graphs from recognized sources and creating labels

Here, we draw graph data from known network data sets and create custom labels for each graph. The graphs are the same as those generated in Section 4.2.1 from ICEWS. However, we also use the Computational Events Data System (CEDS) [70] event data sets, which is formatted like ICEWS. While the graph creation process is not novel, the labeling process is, as these graphs represent human evolutionary systems, but there are no labels associated with the graphs.

We then create custom labels for each graph as follows. We focus our attention on levels of violence. Recall that the magnitude of events ranges in $[-10,10]$ . Based on the Conflict and Mediation Event Observations (CAMEO) [22] encoding, values in $[-10,-7]$ are assigned to physically violent events, such as military actions, riots, or coups; values in $(-7,0)$ represent antagonistic diplomatic or non-kinetic actions; and values greater than or equal to 0 are considered constructive politically, materially, or diplomatically.

While the graphs contain all values, we restrict our attention to violent events in the $[-10,-7]$ range to create the graph labels, as follows. Consider a time-series of $n$ graphs. At each time step, we identify the actions with a value $x\in[-10,-7]$ within the graph graph and sum their values. From the $n$ summed violence values, we compute quartiles $Y=\{\texttt{first},\texttt{second},\texttt{third},\texttt{fourth}\}$ , and each graph in the time-series is labeled by the quartile corresponding to its summed violence value. Note that quartiles are estimated separately for each country, as there would be little variance in the levels of violence for smaller countries if estimates were taken over all countries.

For the purposes of computing the set $S$ of structural properties, we define a node’s influence in all of our benchmark graphs as the sum of the magnitudes (i.e., the $x$ values) of the events associated with its incoming links. Table 2 summarizes the data sets used in our experiments.

Table 2
Experiment data sets

Experiment	Graph source	Label source	Number of classes
Proof-of-Concept	BTC	Custom	4
	Eurovision	Eurovision	5
Benchmarks	ICEWS	DPC	2
		INS
		IC
		REB
		ERV
		ILC
		SB
		Custom	4
	CEDS	Custom	4

4.3 Experimental results

To evaluate GT-CHES, we run a number of experiments involving our Proof-of-Concept data sets and our benchmark data sets, against a number of baseline algorithms as well as graph neural networks and state-of-the-art solutions.

4.3.1 Comparison against baselines

To validate our GT-CHES method, we test it against baselines for dynamic systems. These baselines may seem trivial for current machine learning and statistical tasks. However, as mentioned before, many machine learning and statistical tasks assume a stable data distribution, while dynamic tasks assume a shifting data distribution. We use a majority classifier, a naïve classifier, and a no-regret classifier as baselines. The majority classifier predicts the most common label. The naïve classifier predicts at time $t$ the label of the graph at time $t-1$ . Finally, the no-regret classifier chooses dynamically between the majority classifier and the naïve classifier for its prediction at time $t$ , based on which of the two performs better on the training set up to time $t-1$ . We would expect GT-CHES to outperform these baselines if it successfully finds a predictive signal in the data.

The results on the Proof-of-Concept data sets are shown in Table 3. GT-CHES outperforms all three baselines on all three data sets. We note that the Eurovision data set is smaller and highly imbalanced, which explains the good performance of the majority classifier. While the difference is not statistically significant in that case, GT-CHES still achieves higher accuracy.

Table 3
Comparative Accuracy on Proof-of-Concept Data Sets. The asterisk denotes statistical significance for a one-sided $t$ -test at the 0.01 level

Method	BTC alpha	BTC OTC	Eurovision
Naïve	.446	.439	.362
Majority	.207	.204	.400
No-Regret	.436	.421	.400
GT-CHES	.479^*	.507^*	.409

The results on 152 data sets from ICEWS with Custom labeling are summarized in Table 4. Improvement is the percent improvement in accuracy of GT-CHES over the corresponding baseline averaged across all 152 data sets. Win, Loss and Tie refer to the number of data sets on which the accuracy of GT-CHES is higher than, lower than, or tied with, the accuracy of the corresponding baseline. Again, GT-CHES does generally exceed the baselines.

Table 4

Comparative Performance of GT-CHES on ICEWS Benchmark Data Sets. Numbers in parentheses refer to the number of data sets for which the difference is statistically significant for a one-sided $t$ -test at the 0.01 level

	Naïve	Majority	No-Regret
Improvement	1.2%	19.7%	8.1%
Win	82 (50)	120 (106)	113 (90)
Loss	70 (41)	32 (26)	38 (22)
Tie	0	0	1

Interestingly, Fig. 2 also shows that as the level of chaos increases across the 152 ICEWS data sets, as measured by the Hurst Exponent for the associated levels of violence (smaller values indicate greater levels of chaos), the accuracy of GT-CHES generally tends to increase.

Figure 2.

Relationship between accuracy and level of chaos.

4.3.2 Comparison against graph neural networks

To further validate GT-CHES, we compare it against recent methods that have been designed for the explicit handling of graphs, namely (deep) graph neural networks. The two architectures we consider are the Signed Graph Convolutional Network (SigGCN) [16] and a customized architecture we designed, specifically aimed at fusing graph extraction and time-series analysis. The latter formulation incorporates a deep learning (DL) graph method into a DL recurrence method. Instead of a standard input at each time step, the output of the graph DL method becomes the input to the DL recurrence model. The models are fused in that the errors are back-propagated through the DL recurrence methods back to the DL graph methods, and there is one loss function for the entire architecture. The combined model is an LMU-SigGCN model where an LMUFFT [13] takes the estimates of an FFT over a series of SigGCN outputs as the inputs to its classification block.

Table 5
Comparative accuracy of GT-CHES and deep graph neural networks on CEDS data sets

Data set	GT-CHES	SigGCN	LMU-SigGCN
Balkans	.378	.326	.369
Bosnia	.337	.200	.200
Central Asia	.367	.344	.237
China	.400	.250	.333
Cuba	.285	.271	.198
Gulf	.406	.323	.291
Haiti	.328	.315	.281
India	.281	.295	.253
Levant	.352	.265	.233
Somalia	.494	.383	.277
West Africa	.250	.244	.252
Average	.353	.292	.266

It is clear from these results that GT-CHES significantly outperforms existing, and suitably extended, graph-based classification approaches on the selected CEDS data sets. The performance of the LMU-SigGCN is particularly interesting for comparison, because it is so similar to the intuition behind GT-CHES, but it employs back-propagation rather than making a priori game- and network-theoretic assumptions. Furthermore, as it does not rely on error back-propagation, GT-CHES is also significantly faster in terms of training time (results not included).

4.3.3 Comparison against state-of-the-art

Finally, we compare GT-CHES to state-of-the-art results from the EVD community in the context of our benchmarks. We consider two distinct frameworks.

The Political Violence Early-Warning System [26] (ViEWS) created a standardized process for training and testing data for various labeling tasks. In addition, the ViEWS team continually develops and publishes their own models and results. The study we reference here involves predicting state-based violence (SB) for 54 countries in Africa over a period of 36 months. The data is partitioned into training, calibration, and prediction/forecast sets. The training set consists of the country-month data from January 1990 to December 2011. Calibration data from January 2012 through December 2014 may be used to tune parameters, when applicable. The final model is trained on the combined training and calibration periods and used to predict labels for each of the 54 countries from January 2015 through December 2017. One peculiarity of this study is that instead of just predicting one month into the future, we are to make predictions for 36 months in a one-step ahead fashion, so that at time $t$ , only inputs that occurred up to and including $t$ can be used to predict the labels at $\{t+1,t+2,\ldots,t+36\}$ . That is, we build 36 models and use the graph data of December 2014 in each case to predict the labels of the test set from January 2015 to December 2017, as follows. Let $G_{m}$ and $y_{m}$ be the graph data and graph label for month $m$ . To predict the label of January 2015, we train on pairs $(G_{m},y_{m+1})$ for $m\in\{$ January 1990, …, November 2014 $\}$ ; to predict the label of February 2015, we train on pairs $(G_{m},y_{m+2})$ for $m\in\{$ January 1990, …, October 2014 $\}$ ; to predict the label of March 2015, we train on pairs $(G_{m},y_{m+3})$ for $m\in\{$ January 1990, …, September 2014 $\}$ ; and so on. ViEWS’ advocated model is an ensemble of random forest models each based on different sets of carefully crafted (non-graph-inspired) predictive features, such as proportion of months in training period with conflict, population size, proportion of population living in urban areas, GDP growth, and time since regime change. Table 6 shows how GT-CHES compares to ViEWS’ reported results.

Table 6
Comparative average accuracy of GT-CHES and ViEWS on 54 countries

Method/subset	AUC	AUPR	Accuracy	$F_{1}$
ViEWS	.956	.869	.846	.745
GT-CHES	.942	.867	.884	.747

The results show that GT-CHES compares very favorably with ViEWS. It is interesting that ViEWS considers AUPR the most important metric. On that basis, GT-CHES is not significantly different from ViEWS.

The Disruptive Events Prediction [56] (DEP) also utilizes training and testing sets over a similar time span, namely March 2001 through December 2011, and January 2012 through March 2014, respectively. However, there is no one-step ahead prediction. Once the training and test data sets are partitioned to exclude any leakage, they are just treated as ordinary training and test sets, where graph data at time $t$ is used to predict the label at time $t+1$ . Interestingly, country names are excluded, which means that only global models (across countries) can be built. We also note that for this binary classification task, the label set is imbalanced. Hence, we over-sample the targets with the least support in the training set in order to obtain a balanced label set. DEP considers three types of learners: a gated-recurrence unit (GRU), a random forest, and a linear support vector machine (SVM). Again, as in ViEWS, the (non-graph-inspired) predictive features are carefully selected (from an original set of 1,160 features) based on information gain, mutual information, and number of missing values. The features consist of aggregated counts from the Global Database of Events, Language and Tone [44] (GDELT) database, as well as macro-structure metrics from the World Development Indicators1

https:/data.worldbank.org/data-catalog/world-development-indicators.

(WDI) and Worldwide Governance Indicators2

https://data.worldbank.org/data-catalog/worldwide-governance-indicators.

(WGI) published by the World Bank. The features are similar to those used in the ViEWS study (GDP, population, democracy) but the variety and volume differ (including by task). The DPC task has 151 features, the ERV task has 118 features, the INS task has 100 features, the REB task has 135 features, the IC task has 173 features, and the ILC task has 44 features. By comparison, GT-CHES uses 39 graph-inspired features. Table 7 shows how GT-CHES compares to DEP’s reported results.

Table 7

Comparative average accuracy of GT-CHES and DEP

Method/subset	DP	ERV	IC	INS	REB	ILC
GRU	.82	.98	.86	.97	.99	.79
SVM	.74	.81	.74	.90	.88	.54
Random forest	.86	.97	.89	.98	.99	.83
GT-CHES	.74	.83	.76	.90	.87	.57
GT-CHES $+$ WGI	.81	.94	.85	.97	.93	.67

The results show that GT-CHES performs slightly better than the SVM model. It does not beat the GRU nor Random Forest model. However, when we add the 6 WGI features, which are estimations by experts of the level of democracy and corruption, GT-CHES’ performance becomes comparable. We do note that using a country-by-country approach, as in the ViEWS study, would likely be greatly advantageous. Given the importance that geography plays on societal networks, especially in Africa, and the diversity of the geography of the continent [31], we would expect networks to behave differently among the countries.

While GT-CHES does not outperform all methods, it is never the worst either. This is interesting considering the differences in the quality of input data among methods. ViEWS and DEP use macro-structural metrics as inputs, while GT-CHES takes repurposed event-based data. The macro-structural measures have proven to be rather reliable predictors of violence [23, 74], and have so far been superior to event-based data under the current paradigms of violence prediction [67]. This may be explained in part by the repurposed nature of event-based data having possibly lower quality (though higher quantity) than the curated nature of WDI and WGI data. Furthermore, we are very much at the experimental level of formulating graphs properly using event data, as well as identifying structural qualities that are generally good predictors of political violence. GT-CHES is certainly a positive and encouraging step in that direction.

5. Conclusion

We have introduced GT-CHES, an approach for graph classification in the context of human evolutionary systems, based on principles from game theory and chaos theory. We capture explicit structural properties of the graphs at each time step and transform the corresponding time series into the frequency domain. The resulting sets of features are then used by a random forest algorithm to build a predictive model. Results on a number of simple data sets show that GT-CHES matches or exceeds baseline approaches such as a majority learner and a naive classifier. Furthermore, accuracy tends to be increasing as the level of chaos in the system increases. We also compared GT-CHES with deep graph neural networks and showed that GT-CHES outperforms these approaches. This is particularly interesting in the context of very recent findings about the relationships between Fourier analysis and deep learning [72].

Given the relative recency of work in evolutionary graph classification, we designed two real-world benchmark data sets based on well-known geo-political data. These benchmarks are available to the broader community. We showed that GT-CHES performs comparatively to the best published results in political science with approaches based on extensive crafting of predictive features and tuning of models. We expect that tuning of GT-CHES will lead to further improvements. It is also possible that GT-CHES should be viewed as orthogonal to the current macro-structural data approaches. Hence, when used in combination with the current approaches, for example in an ensemble, it may effectively boost performance.

Footnotes

Acknowledgments

We wish to thank Brad Hatch for insightful discussions and help in implementing the GNN models used in our experiments.

Conflict of interest

C. Giraud-Carrier is an Editorial Board Member of this journal, but was not involved in the peer-review process nor had access to any information regarding its peer-review.

References

Axelrod

and Cohen

M.D.

, Harnessing complexity, Basic books, 2008.

Beger

Dorff

C.L.

and Ward

M.D.

, Irregular leadership changes in 2014: Forecasts using ensemble, split-population duration models, International Journal of Forecasting 32(1) (2016), 98–111.

Bengio

Simard

and Frasconi

, Learning long-term dependencies with gradient descent is difficult, IEEE Transactions on Neural Networks 5(2) (1994), 157–166.

Bonacich

, Power and centrality: A family of measures, American Journal of Sociology 92(5) (1987), 1170–1182.

Bonacich

and Lloyd

, Calculating status with negative relations, Social Networks 26(4) (2004), 331–338.

Boschee

Lautenschlager

O’Brien

Shellman

Starz

and Ward

, ICEWS Coded Event Data, 2015.

Boser

B.E.

Guyon

I.M.

and Vapnik

V.N.

, A training algorithm for optimal margin classifiers, in: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, 1992, pp. 144–152.

Bronstein

M.M.

Bruna

LeCun

Szlam

and Vandergheynst

, Geometric deep learning: Going beyond euclidean data, IEEE Signal Processing Magazine 34(4) (2017), 18–42.

Bryson

A.E.

and Ho

Y.-C.

, Applied optimal control: optimization, estimation, and control, Routledge, 2018.

10.

Cartwright

and Harary

, Structural balance: A generalization of heider’s theory. Psychological Review 63(5) (1956), 277.

11.

Ceriani

and Verme

, The origins of the gini index: Extracts from variabilità e mutabilità (1912) by corrado gini, The Journal of Economic Inequality 10(3) (2012), 421–443.

12.

Chen

Wang

and Xu

, GC-LSTM: Graph convolution embedded LSTM for dynamic network link prediction, Applied Intelligence 52(7) (2022), 7513–7528.

13.

Chilkuri

N.R.

and Eliasmith

, Parallelizing legendre memory unit training, in: International Conference on Machine Learning, PMLR, 2021, pp. 1898–1907.

14.

Costantini

and Perugini

, Generalization of clustering coefficients to signed correlation networks, PloS One 9(2) (2014), e88669.

15.

Cui

Zhuang

Liu

and Wang

, Semi-supervised gated spectral convolution on a directed signed network, IEEE Access 8 (2020), 49705–49716.

16.

Derr

and Tang

, Signed graph convolutional networks, in: 2018 IEEE International Conference on Data Mining (ICDM), IEEE, 2018, pp. 929–934.

17.

Easley

and Kleinberg

, Networks, crowds, and markets: reasoning about a highly connected world, Cambridge University Press, 2010.

18.

Epstein

J.M.

, Agent-based computational models and generative social science, Complexity 4(5) (1999), 41–60.

19.

Fagiolo

, Clustering in complex directed networks, Physical Review E 76(2) (2007), 026107.

20.

Freund

and Schapire

R.E.

, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences 55(1) (1997), 119–139.

21.

Gao

and Li

, A graph-based LSTM model for pm2. 5 forecasting, Atmospheric Pollution Research 12(9) (2021), 101150.

22.

Gerner

D.J.

Schrodt

P.A.

Yilmaz

and Abu-Jabr

, Conflict and mediation event observations (cameo): A new event data framework for the analysis of foreign policy interactions, International Studies Association, New Orleans, 2002.

23.

Goldstone

J.A.

Bates

R.H.

Epstein

D.L.

Gurr

T.R.

Lustik

M.B.

Marshall

M.G.

Ulfelder

and Woodward

, A global model for forecasting political instability, American Journal of Political Science 54(1) (2010), 190–208.

24.

Guha

Kumar

Raghavan

and Tomkins

, Propagation of trust and distrust, in: Proceedings of the 13th International Conference on World Wide Web, ACM, 2004, pp. 403–412.

25.

Harary

, On the notion of balance of a signed graph, Michigan Mathematical Journal 2 (1953-54), 143–146.

26.

Hegre

Allansson

Basedau

Colaresi

Croicu

Fjelde

Hoyles

Hultman

Högbladh

Jansen

et al., Views: A political violence early-warning system, Journal of Peace Research 56(2) (2019), 155–174.

27.

Hinton

G.E.

Osindero

and Teh

Y.-W.

, A fast learning algorithm for deep belief nets, Neural Computation 18(7) (2006), 1527–1554.

28.

Hochreiter

and Schmidhuber

, Long short-term memory, Neural Computation 9(8) (1997), 1735–1780.

29.

Jaeger

, The “echo state” approach to analysing and training recurrent neural networks-with an erratum note, Bonn, Germany: German National Research Center for Information Technology GMD Technical Report 148(34) (2001), 13.

30.

Jaeger

and Haas

, Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication, science, 2004.

31.

John

, Africa: a biography of the continent, 1997.

32.

Joy

Mattheakis

and Protopapas

, Rctorch: a pytorch reservoir computing package with automated hyper-parameter optimization, arXiv preprint arXiv:2207.05870, 2022.

33.

Klimt

and Yang

, Introducing the enron corpus, in: CEAS, Vol. 45, 2004, pp. 92–96.

34.

, A graph convolution for signed directed graphs, arXiv preprint arXiv:2208.11511, 2022.

35.

Kriege

N.M.

Johansson

F.D.

and Morris

, A survey on graph kernels, Applied Network Science 5(1) (2020), 1–42.

36.

Krizhevsky

Sutskever

and Hinton

G.E.

, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, 2012, 25.

37.

Kumar

Hamilton

W.L.

Leskovec

and Jurafsky

, Community interaction and conflict on the web, in: Proceedings of the 2018 World Wide Web Conference, 2018, pp. 933–943.

38.

Kumar

Hooi

Makhija

Kumar

Faloutsos

and Subrahmanian

, Rev2: Fraudulent user prediction in rating platforms, in: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, ACM, 2018, pp. 333–341.

39.

Kumar

Spezzano

Subrahmanian

and Faloutsos

, Edge weight prediction in weighted signed networks, in: 2016 IEEE 16th International Conference on Data Mining (ICDM), IEEE, 2016, pp. 221–230.

40.

LeCun

Bengio

and Hinton

, Deep learning, Nature 521(7553) (2015), 436–444.

41.

LeCun

et al., Generalization and network design strategies, Connectionism in Perspective 19 (1989), 143–155.

42.

LeCun

Touresky

Hinton

and Sejnowski

, A theoretical framework for back-propagation, in: Proceedings of the 1988 Connectionist Models Summer School, Vol. 1, 1988, pp. 21–28.

43.

Lee

and Kang

, Self-attention graph pooling, in: International Conference on Machine Learning, PMLR, 2019, pp. 3734–3743.

44.

Leetaru

and Schrodt

P.A.

, Gdelt: Global data on events, location, and tone, 1979–2012, in: ISA Annual Convention, Citeseer, Vol. 2, 2013, pp. 1–49.

45.

A.-W.

Xiao

and Xu

X.-K.

, The family of assortativity coefficients in signed social networks, IEEE Transactions on Computational Social Systems 7(6) (2020), 1460–1468.

46.

Alderson

Doyle

J.C.

and Willinger

, Towards a theory of scale-free graphs: Definition, properties, and implications, Internet Mathematics 2(4) (2005), 431–523.

47.

Lustick

O’Brien

Shellman

Siedlecki

and Ward

, ICEWS Events of Interest Ground Truth Data Set, 2015.

48.

Wang

Aggarwal

C.C.

and Tang

, Graph convolutional networks with eigenpooling, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019, pp. 723–731.

49.

Melander

Pettersson

and Themnér

, Organized violence, 1989–2015, Journal of Peace Research 53(5) (2016), 727–742.

50.

Tso

G.K.

Zhou

and Zhu

, Motif-aware temporal gcn for fraud detection in signed cryptocurrency trust networks, arXiv preprint arXiv:2211.13123, 2022.

51.

Newman

M.E.

, Assortative mixing in networks, Physical Review Letters 89(20) (2002), 208701.

52.

Nowak

and Sigmund

, The evolution of stochastic strategies in the prisoner’s dilemma, Acta Applicandae Mathematicae 20(3) (1990), 247–265.

53.

Nowak

M.A.

, Natural cooperation, in: Neurosciences and the Human Person: New Perspectives on Human Activities, Vatican, Pontifical Academy of Sciences, Vol. 121, 2013, pp. 237–241.

54.

Onnela

J.-P.

Saramäki

Kertész

and Kaski

, Intensity and coherence of motifs in weighted complex networks, Physical Review E 71(6) (2005), 065103.

55.

Oreshkin

B.N.

Carpov

Chapados

and Bengio

, N-beats: Neural basis expansion analysis for interpretable time series forecasting, arXiv preprint arXiv:1905.10437, 2019.

56.

Parrish

N.H.

Buczak

A.L.

Zook

J.T.

Howard

J.P.

Ellison

B.J.

and Baugher

B.D.

, Crystal cube: Multidisciplinary approach to disruptive events prediction, in: International Conference on Applied Human Factors and Ergonomics, Springer, 2018, pp. 571–581.

57.

Karimian

and Liu

, A hybrid model for spatiotemporal forecasting of pm2. 5 based on graph convolutional neural network and long short-term memory, Science of the Total Environment 664 (2019), 1–10.

58.

Radzicki

M.J.

, Institutional dynamics, deterministic chaos, and self-organizing systems, Journal of Economic Issues 24(1) (1990), 57–102.

59.

Raghavendra

Sharma

Kumar

et al., Signed link representation in continuous-time dynamic signed networks, arXiv preprint arXiv:2207.03408, 2022.

60.

Rosenstein

M.T.

Collins

J.J.

and De Luca

C.J.

, A practical method for calculating largest lyapunov exponents from small data sets, Physica D: Nonlinear Phenomena 65(1-2) (1993), 117–134.

61.

Rumelhart

D.E.

Hinton

G.E.

and Williams

R.J.

, Learning internal representations by error propagation, Technical report, California Univ San Diego La Jolla Inst for Cognitive Science, 1985.

62.

Salinas

Flunkert

Gasthaus

and Januschowski

, Deepar: Probabilistic forecasting with autoregressive recurrent networks, International Journal of Forecasting 36(3) (2020), 1181–1191.

63.

Saramäki

Kivelä

Onnela

J.-P.

Kaski

and Kertesz

, Generalizations of the clustering coefficient to weighted complex networks, Physical Review E 75(2) (2007), 027105.

64.

Scarselli

Gori

Tsoi

A.C.

Hagenbuchner

and Monfardini

, The graph neural network model, IEEE Transactions on Neural Networks 20(1) (2008), 61–80.

65.

Schelling

T.C.

, The strategy of conflict. prospectus for a reorientation of game theory, Journal of Conflict Resolution 2(3) (1958), 203–264.

66.

Schrauwen

Verstraeten

and Van Campenhout

, An overview of reservoir computing: theory, applications and implementations, in: Proceedings of the 15th European Symposium on Artificial Neural Networks, 2007, pp. 471–482.

67.

Schrodt

, Technical forecasting of political conflict, Indiana University Workshop in Methods, 2020.

68.

Schrodt

P.A.

, Early warning of conflict in southern lebanon using hidden markov models, in: American Political Science Association, 1997.

69.

Schrodt

P.A.

, Seven deadly sins of contemporary quantitative political analysis, Journal of Peace Research 51(2) (2014), 287–300.

70.

Schrodt

P.A.

and Gerner

D.J.

, Analyzing international event data: a handbook of computer-based techniques, University of Kansas, Online Manuscript, http://www.ku.edu/keds/papers.dir/automated.html, 2000.

71.

Smyl

Ranganathan

and Pasqua

, M4 forecasting competition: Introducing a new hybrid es-rnn model, URL: https://eng.uber.com/m4-forecasting-competition, 2018.

72.

Subel

Guan

Chattopadhyay

and Hassanzadeh

, Explaining the physics of transfer learning in data-driven turbulence modeling, PNAS Nexus, 01 2023.

73.

Takens

, Detecting strange attractors in turbulence, in: Dynamical Systems and Turbulence, Warwick 1980, Springer, 1981, pp. 366–381.

74.

Valentino

B.A.

, Final solutions, Cornell University Press, 2013.

75.

Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A.N.

Kaiser

Ł.

and Polosukhin

, Attention is all you need, Advances in neural information processing systems, 2017, 30.

76.

von Neumann

, Zur theorie der gesellschaftsspiele, Mathematische Annalen 100 (1928), 295–320.

77.

Wang

and Yu

, Tpgnn: learning high-order information in dynamic graphs via temporal propagation, arXiv preprint arXiv:2210.01171, 2022.

78.

Wei

Zhao

Yao

and He

, Pooling architecture search for graph classification, in: Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 2021, pp. 2091–2100.

79.

Leskovec

and Jegelka

, How powerful are graph neural networks? arXiv preprint arXiv:1810.00826, 2018.

80.

Yao

and Joe-Wong

, Interpretable clustering on dynamic graphs with recurrent graph neural networks, in: AAAI, 2021, pp. 4608–4616.

81.

Ying

You

Morris

Ren

Hamilton

and Leskovec

, Hierarchical graph representation learning with differentiable pooling, Advances in neural information processing systems, 2018, 31.

82.

Zhong

and Huang

, A dynamic graph representation learning based on temporal graph transformer, Alexandria Engineering Journal 63 (2023), 359–369.

GT-CHES: Graph transformation for classification in human evolutionary systems

Abstract

Keywords

1. Introduction

2. Related work

3. Graph classification in chaotic systems

Table 1 The feature set S of graph structural properties

4. Experiments

4.1 Proof-of-concept data sets

4.2 Benchmark data sets

4.2.2 Drawing graphs from recognized sources and creating labels

Table 2 Experiment data sets

4.3.1 Comparison against baselines

Table 3 Comparative Accuracy on Proof-of-Concept Data Sets. The asterisk denotes statistical significance for a one-sided t -test at the 0.01 level

Table 5 Comparative accuracy of GT-CHES and deep graph neural networks on CEDS data sets

Table 6 Comparative average accuracy of GT-CHES and ViEWS on 54 countries

Footnotes

Acknowledgments

Conflict of interest

References

Table 1
The feature set $S$ of graph structural properties

Table 2
Experiment data sets

Table 3
Comparative Accuracy on Proof-of-Concept Data Sets. The asterisk denotes statistical significance for a one-sided $t$ -test at the 0.01 level

Table 5
Comparative accuracy of GT-CHES and deep graph neural networks on CEDS data sets

Table 6
Comparative average accuracy of GT-CHES and ViEWS on 54 countries