Abstract
Although agent-based models (ABMs) have been increasingly accepted in social sciences as a valid tool to formalize theory, propose mechanisms able to recreate regularities, and guide empirical research, we are not aware of any research using ABMs to assess the robustness of our statistical methods. We argue that ABMs can be extremely helpful to assess models when the phenomena under study are complex. As an example, we create an ABM to evaluate the estimation of selection and influence effects by SIENA, a stochastic actor-oriented model proposed by Tom A. B. Snijders and colleagues. It is a prominent network analysis method that has gained popularity during the last 10 years and been applied to estimate selection and influence for a broad range of behaviors and traits such as substance use, delinquency, violence, health, and educational attainment. However, we know little about the conditions for which this method is reliable or the particular biases it might have. The results from our analysis show that selection and influence are estimated by SIENA asymmetrically and that, with very simple assumptions, we can generate data where selection estimates are highly sensitive to misspecification, suggesting caution when interpreting SIENA analyses.
Keywords
Data simulation, as defined by Kéry and Royle (2016), is the “generation of random numbers from a stochastic process that is described by a series of distributional statements.” 1 There are many benefits of simulated data, such as testing the sensitivity of certain model parameters, or examining the robustness of an estimator and the effects of violations of estimator assumptions (Kéry and Royle 2016). 2 The value in using data simulation in the testing and benchmarking of novel analysis methodologies is that the ground truth conditions that generated the data are known. Furthermore, counterfactual experiments can be readily created allowing testing of methodologies against theoretical conditions for which real data are not available.
Typically, data are simulated by first specifying the statistical distributions (including possible covariate interaction) and experimental conditions (such as missing data or observer bias), after which parameter values are selected that result in the desired statistical outcomes. However, when studying problems that involve interactions between actors and nonlinear dependencies, statistical distributions and interactions might be too coarse to evaluate the robustness and validity of a statistical model (Manzo 2014).
In this article, we argue that ABMs are particularly useful to assess models when the phenomena under study is complex, which can be understood as behavior that “cannot be reduced to the simple sum, or aggregation, of individual behaviors” (Hoffer 2013). Agent-based models 3 (ABMs) are effective for the generation of complex simulated data because they are typically designed by specifying the microrules for local interactions which lead to emergent or surprising behavior at the macrolevel (Kéry and Royle 2016; Matthews et al. 2007). Although ABMs have been increasingly accepted as a valid tool in social sciences to formalize theory (Squazzoni 2012), to propose mechanisms able to recreate regularities, and to guide empirical research (Chattoe-Brown 2013; Conte and Paolucci 2014; Macy and Willer 2002), we are unaware of previous research that uses ABMs to assess the robustness of statistical methods.
Through the generation of data that not only have specific desired distributions and interactions, but possibly other emergent structures and correlations, ABMs offer a unique opportunity to evaluate our analytic tools. This approach also allows increased flexibility in specifying the behavior of complex systems by including qualitative insights (Ghorbani, Dijkema, and Schrauwen 2015), formal model learning (Epstein 2006; Squazzoni 2012), and interactions between actors (Gibert and Hamill 2016). As a demonstration of this methodology, we use an ABM to generate data that is then used to assess the Simulation Investigation for Empirical Network Analysis (SIENA) estimation of selection and influence effects. SIENA is a network statistical model to estimate network dynamics for longitudinal network data collected as panel data. This particular example is relevant for several reasons. Firstly, SIENA is a relatively new and prominent methodology employed by a diverse and growing number of publications in social sciences. Secondly, we still know little about its robustness. Thirdly, while some work has been done testing some of SIENA’s assumptions, to our knowledge, no one has tested specific mechanisms for network dynamics (i.e., network selection and social influence). And finally, social dynamics exhibit a number of complexities that are poorly captured by linear models.
We divide the remainder of this article into seven sections. The first section briefly introduces and motivates SIENA and discusses some of its fundamental assumptions. In the second and third section, we describe our ABM and the scenarios used to assess SIENA’s performance. We also discuss the calibration and verification procedures implemented. In the fourth section, we present the SIENA specifications utilized to analyze the generated data, while the fifth section describes our experimental design. We then discuss the results showing the effectiveness of SIENA when estimating selection and influence, address the implications of our results, and motivate the use of ABMs to assess our statistical tools.
SIENA
Background and Motivation
A powerful and pervasive regularity in the social world is the interdependence between social connections and individual traits, behaviors, and attitudes. Those who are proximate or share the same social connections tend to be more similar than those who do not. This phenomenon is usually explained with two different processes: selection and influence. On the one hand, it is argued that the similarity of attributes and behavior is just the result of the selection of network partners, either because we are more likely to contact those who are proximate (i.e., those with whom we share the same geographic location or organizational context) or because we prefer similar peers due to cognitive mechanisms (Bishop 2009; McPherson, Smith-Lovin, and Cook 2001). On the other hand, a long line of investigation suggests that social networks affect and modify individual traits and behaviors as diverse as health, deviance and criminality, norms, educational attainment, smoking, and migration (Christakis and Fowler 2011).
Great effort has been paid to empirically identifying selection and social influence. Identifying which mechanism plays the strongest role can be decisive for understanding the evolution of a social system and, consequently, the success or failure of intervention programs. A policy that is successful in a context where selection is predominant may fail when influence is the main social force of change. Unfortunately, disentangling selection and social influence have proven to be extremely difficult (Shalizi and Thomas 2011; VanderWeele 2011). To infer selection or sorting, we would need dynamic microlevel data that capture changes in individual social networks and connections while behavior remains the same. To infer influence, in turn, we need data measuring changes in behavior while the individual social network remains the same. Furthermore, we would need to account for confounding processes that affect both behavior and network formation.
These challenges have not prevented the development of statistical methods aiming to identify selection and social influence. 4 One of the most recent and prominent efforts to simultaneously model selection and influence is SIENA, a model proposed by Snijders (2001). This method has gained popularity over the years and been used in a number of publications estimating selection and influence for a wide range of behaviors and traits: delinquency (Knecht et al. 2010; Weerman 2011; Weerman, Wilcox, and Sullivan 2017), violence (Dijkstra, Berger, and Lindenberg 2011; Huitsing et al. 2014; Rulison, Gest, and Loken 2013), smoking (Haas and Schaefer 2014; Mercken et al. 2010; Steglich et al. 2012), drinking (Huang et al. 2014; Knecht et al. 2011; Mercken et al. 2012), drugs (Pearson, Sieglich, and Snijders 2006), depression (Schaefer, Kornienko, and Fox 2011; Van Zalk et al. 2010), obesity (Shoham et al. 2012; Simpkins et al. 2013; Zhang et al. 2015), educational achievement (Lomi et al. 2011; Rambaran et al. 2016), and among others.
Despite the increasing number of publications using this method, we know little about its robustness. SIENA estimates selection and influence by reproducing unobserved behavioral and network changes over time. However, we do not know how reliable and successful these imputations (and their assumptions) are. As Edmonds and Meyer (2013) put it, “When this technique (SIENA) is reliable and what its particular biases are, have not yet been established” (p. 729).
There exist simulation-based studies for SIENA, though. For instance, sensitivity analyses on different types of missing data using SIENA have been discussed in Huisman and Snijders (2003) and Huisman and Steglich (2008). Stadtfeld et al. (2017), on the other hand, examine how change of network ties and the composition of actors (leavers and joiners) affect SIENA’s statistical power. However, those studies use the same data generation mechanisms specified by SIENA. To our knowledge, no published work has assessed the robustness of SIENA estimates using alternative mechanisms.
We set out in this example to pose two questions: (1) How sensitive SIENA estimates are to different levels of misspecifications? and (2) Under what circumstances does SIENA yield valid estimates? To answer these questions, we design an ABM that generates synthetic data and reproduces some common aggregate regularities of social networks such as low network density, fat-tail distribution of degree, short paths, reciprocity, and clustering. Then, by controlling the parameters of the ABM (including whether influence or selection is enabled), we explore how well and under what conditions SIENA produces reliable and consistent estimates. Since we never know the optimal specification of SIENA, it is valuable to know how robust its estimates are to misspecification and to identify which of them are particularly consequential.
SIENA Description
Stochastic actor-based models have the primary aim of statistical inference of processes of network and behavior change. 5 Although they can be thought of as ABMs because decisions (i.e., changes in ties and behavior) are made from the perspective of a focal actor, they also include elements of generalized linear statistical models not often seen in ABMs (Snijders and Steglich 2015).
SIENA’s authors argue that their technique overcomes critical limitations of traditional methods. First, SIENA estimates selection and influence as parallel processes using complete network data, enabling feedback between network and behavior changes (i.e., a coevolution model), and adjusting statistically for structural network tendencies, such as reciprocity and transitivity, that might explain the formation or destruction of ties. It is argued that not controlling for such tendencies might misleadingly lead us to infer sorting or assimilation effects (Steglich, Snijders, and Pearson 2010). Second, SIENA addresses the missing data problem that results from using discrete-time panel data. This reminds us of the identification problem of selection and social influence effects when using cross-sectional data. Using simulation to impute unobserved network and behavior changes in a continuous time framework, SIENA’s authors claim to facilitate identification (Steglich et al. 2010; Veenstra and Steglich 2012). However, this strategy does not overcome problems related to latent homophily (i.e., unobserved selection processes, see Shalizi amd Thomas 2011). Using observational data, we cannot exclude the possibility that perceived social influence concerning a variable
SIENA usually models repeated measures of the same network and group of actors, assuming they result from a continuous process of change. In this framework, a big change from one observation to the next is conceived as the accumulation of a sequence of small changes, restricting actors in the model to changing their behavior by a predefined step size, or their social networks by one tie at a time. Since the precise trajectory of small changes from one observation to another is unknown (missing data), the model relies on data augmentation procedures (imputation) and simulation-based inference. It also assumes that actors act independently without coordination. Change needs to occur slowly enough, so actors can make decisions in response to a surrounding network that remains approximately stable. 7
As SIENA actors control their network ties and behavior, they have to determine when to make a decision and which decision to make. Two rate functions are used to determine when they make a decision: one for behavior and one for network. Both are exponential distributions independently estimated from the data.
8
Repeated samples from these rate functions determine the time between decisions. Changes, on the other hand, are defined using deterministic objective functions. These model attractiveness of network states
Both network and behavior objective functions are modeled using linear combinations of effects that represent associations between network, behavior, and exogenous variables. They are defined as:
where
To provide an intuitive explanation of the output SIENA provides, we briefly review a toy example discussed by Steglich et al. (2010). This example includes both homophilous selection and influence (assimilation) effects and uses a behavior variable
The out-degree effect,
The behavior objective function, on the other hand, describes the preference of behavior values. It includes two terms describing an intrinsic behavior preference, and one accounting for the average similarity to neighbors:
In this formulation, behavior is centered and ranges from −2 to +2. The linear and quadratic shape parameters of the intrinsic behavior preference define a parabolic objective function with a maximum centered at the score −1 (i.e., 2 in the original scale). In other words, the distribution of behavior over the long run will be unimodal with a maximum value at 2, indicating the intrinsic optimal behavior value. The odds of moving from the average score of 3 to the optimum score of 2 versus staying at 3 are
Although network and behavior dynamics are being modeled independently, SIENA allows feedback between them. More specifically, both homophilous selection and influence effects are estimated simultaneously in the same model (although in different parts) and are controlled for each other (i.e., they are statistically separated). That is why SIENA’s authors claim their method allows disentangling selection and influence effects.
In sum, SIENA makes inferences on networks and behavior dynamics as if each actor maximizes linear combinations of network statistics, behavior, and covariates. This strategy is intended to include all the available information and overcome the difficulties of fitting and understanding a statistical model, but it does not mean the actual processes follow additivity or linearity, or that people select peers/friends by counting the number triangles or reciprocal ties, or actors have a clear notion of all their social connections and can make complicated computations to assess the short-term consequences of adding, keeping or subtracting ties. These differences between the actual processes and SIENA’s assumptions imply that SIENA's algorithm is impartial to the underlying data-generating microprocesses, provided network/behavior statistics are good summaries of the actual processes and produce valid inferences of small change trajectories.
To assess SIENA’s performance, the ideal would be to use ground-truth data; however, these data are hard to obtain in human populations. We would need fine-grained data with no (or very low) measurement error and be able to manipulate selection and social influence processes. An alternative, and more feasible, strategy is to use a generative model to create dynamic data in which we know the mechanisms at play and the small change trajectories (Epstein 2006). This generative model needs to be different from SIENA but still comparable. For this purpose, we create an ABM of behavior and network dynamics that differs somewhat from the mechanisms assumed by SIENA in order to assess its validity and robustness. At best, our model would be a better analogy of the actual processes, and at worst, would be just as different from the mechanisms specified by SIENA allowing to assess its performance under misspecification.
Influence Selection ABM (ISA): A Network and Behavior Dynamic ABM
We develop an ABM, called the ISA, that generates data matching as closely as possible the requirements of SIENA, allowing us to assess its inferences. It is important that any ABM used to test SIENA not follow its same actor behavior assumptions, as we already know the real processes do not follow SIENA’s algorithm. This is what we mean by misspecification: There is no direct translation between the generative processes of ISA and SIENA, so any SIENA specification would be just an approximation to the actual mechanisms, as occurs when analyzing real data. The question is whether SIENA is able to make accurate statistical inferences given this known difference.
While agent-based modeling cannot by itself solve the empirical estimation and identification problems of the selection and influence effects, it offers a unique opportunity to rigorously implement theoretical and empirical assumptions on these processes. By construction, all group- and individual-level variables defining agent attributes are perfectly known (i.e., there is no unobserved heterogeneity). Also, the way in which agents are linked and the composition of their local neighborhood are also completely transparent.
Our ABM is not intended to perfectly represent the actual processes at play but provides a simplified analogy of potential selection and social influence mechanisms. Although these simplified mechanisms might seem plausible, they are still artificial. If SIENA performs well using a different but simplified version of homophilous selection and social influence, we can argue SIENA’s estimates are robust to particular levels of misspecification (those given by our ABM). This does not confirm SIENA’s ability to produce valid selection and influence estimates in the real world. It does, however, increase our confidence in SIENA estimates, especially if our ABM is empirically appealing. If SIENA performs poorly, we can argue its ability to estimate selection and influence is limited and sensitive to the misspecifications given by our ABM. The next question would be which assumptions seem to be the most critical and how likely they are met in reality.
As we discussed in the previous section, some of the key SIENA assumptions and requirements include: (1) each microstep, consisting of a change to a single network edge or a behavior change of a discrete amount, does not depend on past changes (i.e., no coordination between agents, no memory), (2) the rate of change in network and behavior follows an exponential distribution (i.e., continuous time framework) and is constant across actors, 13 (3) all actors share the same objective function when considering how to change their connections and behavior (i.e., low heterogeneity among actors), and (4) all actors are fully aware of the topology of the network when choosing how to change an tie. In addition, ideal data against which to fit SIENA will have no leavers or joiners, or missing data, though, SIENA can accommodate some of these issues.
With these assumptions in mind, we design ISA following four key design concepts. First, we follow the social circles model proposed by Hamill and Gilbert (2009, 2016) to define the social network dynamic. That model, inspired by Simmel’s (1902) theory, adapts the idea that people are embedded in social circles. It generates a social network by giving each agent a social reach (a radius r defining its social circle) of alter agents and produces personal (ego-centric) networks that are limited in size (low network density), vary in size between individuals, display right-skewed distributions, high clustering (i.e., communities), and are positively assortative by the degree of connectivity (i.e., well-connected agents tend to be connected to other well-connected agents). 14 Second, homophilous selection occurs because agents prefer alters that are more similar in behavior: when creating a tie, they choose the agent with the most similar behavior value. Third, we model social influence using a very simple mechanism: agents choose an influencer among their connections and adjust their behavior to be incrementally more similar. 15 Finally, to explore the consequences of heterogeneity for SIENA’s estimates, we include two important sources of heterogeneity in our ABM: social reach (size of the radius to select peers) and behavioral change independent from social influence.
Model Description
The purpose of our model is to simulate a simplified data-generating process to examine the behavior of the SIENA actor-oriented model, specifically the estimation of influence and selection effects within a changing social network. In order to allow agents to experience influence and selection, agents need to track their behavior and their social connections over time. Therefore, the primary characteristics of agents are their behavior (measured using a continuous scale from 0 to 1) and social contacts (connections that form their out-degree). Secondary characteristics are position in space and social circle radius (to capture new potential alters or destroy existing ties).
Agent actions include movement, creation/destruction of ties, and interaction. Regarding movement, each agent has the same step size
In parallel, at a rate defined by

Network change action chart.
At a rate defined by
where b is behavior of the (e)go or the (a)lter, and
Agents are also allowed to make behavior and network changes independently from influence/selection, so that dynamic behavior and networks can still observed when influence or selection are disabled during the experiments. Independent behavior change is controlled by a separate rate,
The initial population is of 200 agents, randomly placed in a 500 × 500 unit toroidal space. Model time is continuous, meaning there is no fundamental clock “tick.” 16 The initial social network is created by connecting each agent to two other randomly chosen agents. Both the social circle and initial behavior are defined using uniform distributions. For each agent, the chance of an increasing versus decreasing behavior tendency is 50 percent. Table 1 describes all the model parameters with their initialization values.
ISA Parameters.
Note: For distributions in the value column, these are resampled for each agent. Initial social network is created by connecting each agent to two other randomly chosen agents. uni(x, y) = a sample from a uniform distribution between x and y; rand(x) = true with a probability of x; ISA = influence selection ABM; ABM = agent-based model.
To generate the output data, the model simulates a measurement, outputting location, social network, and agent behavior data for each agent every
ISA Scenarios, Calibration, and Verification
We generate data for six ABM scenarios. Table 2 displays their differences. While ISA calculates a strength value for each influence and selection, we have not defined a mechanism to either scale the strength of influence or selection, nor have we defined a method to calculate a strength comparable to SIENA (i.e., there is no a direct translation of ISA’s parameters and SIENA coefficients). 18 As a result, when comparing ground truth with SIENA predictions, we measure simply presence or absence. Therefore, we exclude scenarios where selection and social influence are at play simultaneously. Table 3 displays descriptive statistics of the outcomes of 100 replications of each scenario (details of the experimental design are discussed in Experimental Design section).
ISA Scenarios.
Note: B = baseline; S = selection; I = influence; CR = constant radius; ISA = influence selection ABM; ABM = agent-based model.
Descriptives by ISA Scenario (100 Replicates).
Note: Standard deviation is given in parentheses. B = baseline; S = selection; I = influence; CR = constant radius; ISA = influence selection ABM; ABM = agent-based model.
The first scenario (B) corresponds to our baseline where both (homophilous) selection and influence are disabled, and there is heterogeneity in agents’ social circles. This was done to ensure the model does not artificially generate nonzero selection or influence (measured by behavioral-network autocorrelation). Table 3 provides (network) statistics that characterize each scenario (averaged over four waves). Scenario B shows a network density of 0.03, average degree of 10.28, 19 and high levels of reciprocity and transitivity.
To measure for any unintended selection or influence pressures, we use one of the most popular standardized measures for network autocorrelation, Moran’s I:
where n is the number of agents,
In order for SIENA estimates to be valid, SIENA’ authors indicate that the network must have a minimum amount of network stability over time. SIENA’s authors measure this using Jaccard index, that is, Jaccard distances between time-successive networks:
where
The ISA scenarios I and S enable influence or selection, respectively. The influence mechanism does not directly affect network dynamics, so network statistics between scenarios B and I are almost identical, as expected. This is not the case for scenario S, where ties are being defined through a selection process while behavior remains unaffected. As expected, network statistics are different from scenario B, slightly increasing the degree and reducing reciprocity, transitivity, and the Jaccard index. In scenario I, Moran’s coefficient for behavior is .22, and behavior stability is lower than scenario B because behavior is changing in a systematic way and not only through a random process (behavior tendency). For scenario S, Moran’s coefficient is .22, the same as scenario I but less noisy (i.e., smaller standard deviation). Importantly, both influence and selection mechanisms independently increase network autocorrelation.
In order to examine the consequences of agent heterogeneity, scenarios B + CR, I + CR, and S + CR (CR stands for constant radius) set all agents’ social circles to the same value (50). This changes the network structure, increasing degree, reciprocity, and transitivity. In other words, agents are more connected, increasing Moran’s coefficient for scenario I + CR (there are more chances of being influenced) and decreasing that coefficient in scenario S + CR (relatively less clustering of behavior).
How Different Is ISA from SIENA?
Along with the four SIENA’s assumptions discussed above, we examine how different SIENA and ISA are. First, ISA replicates some SIENA assumptions such as (1) a continuous time framework for changes in the network and behavior and (2) the speed of those changes coming from exponential distributions. However, our implementation of change is different. We set several and different rates for agents’ actions such as movement, change of ties, interaction between agents, and modification of behavior. SIENA, on the other hand, estimates one rate for network change and another for behavior modifications. In addition, while for SIENA keeping ties or no change is a valid choice in a discrete choice model, we do not define stability explicitly, but by the time between agents’ actions (e.g., moving/not moving, changing a tie/not changing a tie).
Second, ISA does not satisfy the assumptions of (3) agent heterogeneity or (4) awareness of the network topology. Agents in ISA are heterogeneous along dimensions that impact their network decisions (to create, keep, or destroy ties), and they do not use or have awareness of the complete network topology. They only use information from their agents in their social circles and social networks to create and destroy ties. Social circles vary by agent and their differences can be thought as heterogeneous agents’ propensities to send outgoing ties: given a particular neighborhood those agents with a bigger social circle are more likely to establish connections.
We think violations to assumptions (3) and (4) are not eccentric and it is worth examining their consequences. People are likely to have individual and contextual (unobserved) differences when defining their social networks. In addition, since our sensing and intelligence are limited, it seems reasonable to explore how sensitive SIENA is to alternative micromechanisms of network generation that do not assume perfect information and extraordinary computing abilities (Epstein 2014).
Finally, we should note that ISA generates perfect data, that is, no missing records, no compositional change or measurement error, and does not include any latent homophily process that might confound estimates of selection and influence.
SIENA Specification
Table 4 outlines the effects we include in the SIENA specifications. Each effect is placed within a specification grouping (column 2) to facilitate comparisons between experimental scenarios, as described below. The structural effects include network dynamics at the individual, dyad, and triad (group) level. We chose these effects through an iterative process in which we assess the goodness of fit (GOF) of our models and review the most common effects used in empirical studies. 20
Siena Effects by Specification.
Note: The density effect is allowed to vary by wave in all the specifications. N = network structural effects; S = selection; I = influence; D = distance; R = radius; SIENA = Stochastic Actor-Oriented Model.
Below that, we include two effects directly related to ISA’s generative mechanism: distance and ego radius. The distance effect expresses the extent to which a tie between two actors is less likely when the dyadic distance is larger. This coefficient in our analyses is, as expected, negative and statistically significant. The ego radius effect expresses how actors with larger social circles extend more outgoing ties. The coefficient of this effect is, as expected, positive and significant. We decided to include the radius information only through the ego agent. Effects such a alter radius (those with higher radius have more incoming ties) and radius similarity (actors select peers based on radius) do not seem reasonable given the generative mechanisms of our ABM. 21
Following SIENA authors’ recommendations, we use three selection parameters: ego behavior, alter behavior, and behavior similarity. The ego and alter effects are not significant in our analysis, suggesting no main effect of behavior on either network nominating activity or popularity. Behavior similarity (what we call selection), in turn, indicates descriptive evidence for homophilous selection based on behavior.
The behavioral dynamic part of the model is simpler. Two shape parameters are used to express basic distributional features of the behavior variable and as control variables. The linear shape parameter plays the role of an intercept, whereas the quadratic shape parameter controls for underdispersion (regression to the mean, when the parameter is negative) or overdispersion (polarization, when the parameter is positive). Together, the two shape effects can be interpreted as expressing the relative prominence of the specific values on their distribution. If the shape contribution to the objective function is U shaped, for instance, it suggests that changes in behavior are drawn to the extremes of the range, decreasing behavior of actors with low behavior, and increasing those with high behavior. SIENA’s authors recommend the quadratic shape effect when the behavior variable contains three or more possible values (Veenstra and Steglich 2012). We also adjust for the independent behavior change choice, a binary value (0/1) that defines different triangular distributions to modify behavior. Finally, we use the average alter effect to parameterize social influence. This coefficient suggests that actors whose alters have a higher average value of behavior will, themselves, also have a tendency toward higher behavior. 22
Table 5 summarizes the seven SIENA specifications we use, composed of the effect groupings indicated above and in column 2 of Table 4. We include selection (S) and influence (I) effects separately and together to explore how these parameters affected each other. We do similarly with radius (R) and distance (D) and add a model where the structural network part included only density (out-degree). Thus, we provide a relatively diverse set of specifications.
Siena Specifications.
Note: SIENA = Stochastic Actor-Oriented Model; N = network structural effects; S = selection; I = influence; D = distance; R = radius.
To fit each of these SIENA specifications, we took four measurements for each of the ABM scenarios (corresponding to model day 180, 210, 240, and 270) in order to ensure enough statistical power (Stadtfeld et al. 2017). 23 Our behavior variable is continuous, but SIENA currently only allows ordinal or categorical values when modeling behavior as a dependent variable. We converted our continuous behavior variable into one of 10 discrete levels (empirical applications usually use dependent variables with 4–10 categories).
Experimental Design
Table 6 shows all relevant combinations of the six ISA scenarios and the seven SIENA specifications, resulting in 18 experimental scenarios. As mentioned above, each of the six ISA scenarios is run 100 times to allow the Moran’s I network autocorrelation to stabilize. 24 Each ISA run outputs the social and behavioral landscape at four measurement times, which are then fed into the respective SIENA specification. This results in fitting SIENA 1,800 times, 100 for each of the 18 experimental scenarios. Those models that reach overall convergence are kept and included in Table 6. 25
Coverage Rate, Bias, and SIENA Estimates by Scenario and Specification (100 Replicates).
Note: ISA scenario: B = baseline; I = influence; S = selection; CR = constant radius. Siena specification: N = network structural effects; S = selection; I = influence; D = distance; R = radius. Performance statistics:
To evaluate SIENA's performance, we use two measures: standardized bias and coverage rate. Standardized bias is the raw bias (average parameter estimates across replications minus the true parameter value) divided by the standard deviation of the estimates across all replicates. For instance, a value of −0.5 means that the estimate on average falls one-half of a standard deviation below the parameter. Collins, Schafer, and Kam (2001) provide the rule of thumb that absolute values of 0.4 or higher are practically significant. Coverage rates are the proportion of replications whose 95 percent confidence interval includes the true parameter estimate. The actual coverage should be approximately equal to or greater than the nominal coverage rate (95 percent). According to (Collins et al. 2001), problematic coverage rates are below 90 percent. We also estimate aggregate coefficients for selection and influence effects by using meta-analysis (An 2015; Viechtbauer 2010), so that we can compare the magnitude of effects across SIENA specifications and estimate the probability of observing a positive effect
To assess the GOF of our models, we employ distributions of auxiliary statistics such as out-degree, in-degree, geodesic distance, triad census, and behavior and explore the differences (Mahalonobis distance) between the observed distributions (summed across the four waves of data) and the simulated distributions from SIENA (summed across 1,000 random networks). Most of our specifications reproduce well the degree and behavior distributions. GOF for geodesic distances and triad census, in contrast, is weak. The connection between bias and GOF, though, is not straightforward. While some of our best SIENA estimates (e.g., S + I + R + D) have poor GOF with statistics such as geodesic distance and triad census, specifications with a similar (or even better) GOF appear to have more biased estimates (e.g., N + S + I). Section GOF in the Supplemental Material (which can be found at http://smr.sagepub.com/supplemental/) provides details on the GOF of our models, although more research would be needed to explore the connection between bias and GOF in SIENA.
Results
Table 6 displays coverage rates, standardized biases by scenario and specification, and pooled SIENA estimates. We begin by examining ISA scenario B where selection and influence are disabled. Using the SIENA specification N + S + I (adjusting for structural network effects, selection, and influence), we find that selection bias is expectedly below the 0.40 threshold (0.16), but coverage is below 0.90 (0.85). Furthermore, the selection coefficient, although very close to zero, is statistically significant using standard hypothesis testing procedures. The influence estimate, on the other hand, shows no bias, good coverage, and RE estimate consistent with no influence effects. In other words, while N + S + I correctly reports no influence, the selection estimate is a bit more uncertain.
ISA scenario I enables the influence mechanism. This time, we should expect positive estimates of influence and close-to-zero estimates of selection. As discussed above, we use six SIENA specifications. The first one, N + S, adjusts only for structural network effects and homophilous selection. As expected, this specification shows high bias in the selection estimate as that coefficient does not account for influence, the actual mechanism generating network autocorrelation (as shown in Table 3). We then consider the specification N + S + I that adds an influence effect. In this case, we might expect a more accurate estimation of selection since SIENA is now allowed to adjust for influence. However, allowing SIENA to account for influence (N + S + I) does not improve the selection estimate. Bias remains practically the same, and the pooled selection estimate becomes slightly higher, while the influence estimate is positive and statistically significant. 26
We progressively continue adding information on the mechanisms of the network generation to the SIENA specification. N + S + I + D, for instance, adds a distance between dyads effect. Under this specification, bias is reduced (from 1.73 to 1.31), coverage increases (from 0.44 to 0.64), and the pooled selection coefficient decreases from 0.68 to 0.43. Similarly, N + S + I + R, incorporates a ego radius effect and reduces bias even more (from 1.73 to 0.93). Although selection coefficients remain positive and significant when we expect zero effect, we observe that including information of the generative network formation improves selection estimates.
Then, we consider including both ego radius and distance together using two different network specifications. First, we only adjust for radius and distance and remove all the structural network effects in SIENA except for density (S + I + R + D). In this case, bias decreases to 0.25 (below the 0.40 threshold), coverage remains below 0.90, and the pooled selection estimate is positive and statistically significant. Second, we add the structural network effects back (N + S + I + R + D) and adjusts for radius and distance simultaneously. Here, the general results are the same: the selection bias is below the threshold (0.23), coverage is slightly under 0.90, and the pooled estimate is positive and statistically significant under conventional levels (
All of these experiments were estimated on ground-truth data generated from ISA every 30 days. As discussed in Section S4, we conducted further comparisons varying instead this update period, using the N + S + I SIENA specification throughout. Choosing the shortest update period of five days resulted in coverage, bias, and pooled selection estimates better than all SIENA specifications other than S + I + R + D and N + S + I + R + D. This speaks to the importance of checking the sensitivity of estimates on the measurement updates period.
ISA scenario S enables selection instead of influence. As can be seen in Table 6, standardized biases for influence are all around −0.25, regardless of the SIENA specification, coverage rates never go below 0.98, and pooled coefficients are very close to zero (−0.01). In all cases, influence is correctly measured. Moreover, influence estimates do not change when including/excluding selection effects, similar to what we observed in ISA scenario I.
We finally examine the consequences of heterogeneity. The scenario designation CR removes radius heterogeneity by making social circles constant (i.e., all agents have the same radius). SIENA estimation using the baseline scenario (B + CR) shows biases and coverage rates consistent with no selection and influence effects, better than the baseline model with heterogeneity. Removing heterogeneity also improves selection bias when influence is enabled (I + CR), although selection coefficients are still positive and statistically significant under the N + S + I SIENA specification (the pooled selection effect is 0.19 with an standard error of 0.04). Adjusting for distance (N + S + I + D) results in a sufficiently high coverage (0.92), a sufficiently low selection bias (0.06) and a statistically insignificant selection estimate. With a homogeneous radius and selection enabled (ISA scenario S + CR), influence estimates are again correct and stable: bias ranges around −0.20, the coverage rates are slightly over 0.90, and the pooled estimates are close to zero (−0.01). Evidently, heterogeneity in the network generation process has strong consequences for the estimation of selection using SIENA, even when accounting for that source of heterogeneity (e.g., adjusting for variables such as ego radius).
Conclusion and Discussion
The goal of this article is to demonstrate how ABM-generated simulated data can provide novel and valuable insight into the testing of new methodologies. This was demonstrated with an ABM built to generate test data for the SIENA analysis tool. Specifically, we explore how sensitive SIENA’s selection and influence estimates are to misspecification and examine the conditions under which SIENA provides correct estimates.
The primary result from this work is that, while influence estimates were consistently robust to variation in SIENA specification and across all ISA scenarios, selection estimates were highly sensitive. As we increased the complexity of the SIENA specification, the selection bias did improve, however in order to obtain accurate estimates of non-selection, we needed to both incorporate knowledge of the data-generating process itself (by adding social circles and social distance to the SIENA specification) and reduce the heterogeneity in the ISA population (by enforcing a constant social radius). This is an important conclusion for two reasons. Firstly, SIENA claims to be able to obtain accurate selection estimates conditional on a model specification that is a good approximation to the data-generating process. Knowledge on the data-generating process is typically unavailable and including parameters that has the potential to correctly describe the network dynamic seems not to be enough to recover the true parameters. Secondly, social networks are marked by a great deal of heterogeneity. This latter result was expected given that SIENA assumes a homogeneous objective function when defining ties and behavior evolution. This method, however, demonstrates a way to quantify the degree of sensitivity to this assumption, allowing researchers to better understand when SIENA estimates might be overly biased.
In describing our hypothesis as to why selection is often overestimated while influence is robust, it is helpful to refer to Figure 2. It displays different dyad combinations and how they can change over time. This figure provides a very intuitive understanding of how SIENA proposes to separate influence from selection. If we measure a dyad in the state on the right in one measurement and the bottom in the next, it is thought to be easier to follow path {d} than path {c, b, a}; therefore, selection should be easily distinguishable from the confounding effects. However, with random behavior change as in our ISA scenarios, step {c} (from right to top) would be very likely since random behavior changes are more likely to make any nonconnected pair of similar agents less similar. Once in the top state (two dissimilar disconnected actors), step {b} is more likely for the same reason: randomly moving closer in behavior is less likely than randomly connecting. Finally, if influence is enabled in the ABM, step {a} from left to bottom is expected. This shows how selection could be observed when no selection is actually present.

Configurations of similarity and friendship in a dyad. Start with the first left connection and follow through path (b, c, and d). This selection-only path is more complex than the influence path of simply taking (a). Hence, measurement of influence when it is not present is unlikely. Source: Veenstra and Steglich (2012).
As to why influence measures are robust, we start the same type of argument at the left configuration of Figure 2 (two dissimilar connected actors). Being able to accurately measure influence is akin to distinguishing between path {a} and path {b, c, d}. For similar reasons as above, step {c} (top to right) is relatively unlikely due to ISA’s random behavior change. Therefore, path {b, c, d}, which is a confounding path for measuring influence, is much less common than {a}, which is the path measuring influence, whereas path {c, b, a}, confounding for selection, could be nearly comparable to path {d}, true selection. This would result in influence estimates being much more robust than selection estimates.
A simple test to check this interpretation is to add a social norms mechanism to our model, such that step {c} becomes more equally likely from either direction, and see if the influence stability remains. In the Section S3, we provide evidence that supports this interpretation as influence becomes positive when adding social norms. These results also provide interesting insights regarding how independent changes in behavior can affect the robustness of selection and influence estimates.
Although SIENA allows feedback between selection and influence processes, the analysis of the data generated by our ABM suggests that SIENA measures selection and influence almost independently. In ISA scenario I, for instance, the estimation of selection remains unchanged when we add the influence effect. This might be also explained using Figure 2. As we discuss above, given two discrete data points (i.e., waves), inferring (unobserved) selection and influence trajectories at the same time seems very unlikely. In other words, simulated microchanges tend to prioritize either selection or influence inferences when imputing unobserved changes given two data points. If covariance between selection and influence inferences is close to zero, adjusting for each other will not have any consequences for the estimates. To fully validate this hypothesis, we would need more research, especially since selection is not well captured in many of our experimental scenarios, for reasons discussed above.
In sum, we recommend caution when using SIENA estimates. Our simulated ground-truth uses data generated from a simple ABM where no measurement error, missing or censored data, panel attrition or compositional change is present. Also, no latent homophily affecting selection or influence estimates was included. Analysis with real data might be even more challenging, and additional sources of bias might be overlaid on top of the conditions explored in this article. If we suspect that the problem we are studying is context dependent and heterogeneous, and we only have partial knowledge or no good theory on the key mechanisms that generates the network dynamic, the relative importance of selection and influence estimates provided by SIENA should be interpreted with extreme caution.
It could be argued that our ABM scenarios are too simple to be a valuable comparison against SIENA estimates. For example, social networks likely are very rarely, if ever, devoid of selection effects. How valuable, then, are discrepancies between SIENA estimates and ISA ground truth in light of this oversimplification? In response, we would point out that this research is not meant as the final word on SIENA’s limitations; certainly more work is required, work that includes a strength measure for each effect, thereby allowing a wider array of experimental scenarios for the ABM. As is typical when conducting “benchmarking” tests, it is wise to begin at the extremes of simplicity to allow for clear thinking of the mechanisms. This argument to simplicity does not invalidate the conclusion that SIENA appears to treat selection and influence asymmetrically.
Relatedly, while the SIENA estimates in this article are specific to how ISA was designed, it should be noted that the conclusions are potentially broadly applicable. The micromechanisms in ISA are certainly an oversimplification of real-world selection and influence processes. However, they were chosen to generate specific key population-level phenomena observed in the real world (such as social clustering). While the Generativist Manifesto (Epstein 2006) says that generating a macrobehavior is not sufficient to explaining the real-world mechanisms, it is nonetheless a necessary element. Since this plausible, though simple, mechanism demonstrates surprising sensitivity in SIENA estimates, further exploration under different generating mechanisms is warranted.
There are many future research paths suggested by this work. We could include more complex and realistic mechanisms to better understand robust or sensitive parameter regimes of SIENA. For instance, this work could be extended to study the impact of time, to observe when or how often we might want to take measurements, and how long SIENA’s predictions might remain valid. It would be also useful to examine how sensitive selection and influence SIENA estimates are to latent homophily. And finally, this approach would be very powerful when studying the effects of noise in the survey data. Independent changes in behavior affects influence estimates opening the question on how noise (e.g., measurement error, stochasticity) might bias downward social influence estimation when using SIENA. In this scenario, it would be of interest, for example, to examine whether it is more important to focus on accuracy in the social network or in the behavioral measure.
In conclusion, in order to be able to build confidence in our analysis techniques, especially when applied to complex and dynamic systems, it is important to be able to validate them against a diverse set of mechanisms that might plausibly represent real-world situations. With the use of modeling techniques such as ABM, a wide range of previously difficult problems becomes much more accessible. We have conducted a first step in this direction, but much more is to be done to assess complex statistical models like SIENA.
Supplemental Material
supplemental_material_resubmission - Agent-Based Models for Assessing Complex Statistical Models: An Example Evaluating Selection and Social Influence Estimates from SIENA
supplemental_material_resubmission for Agent-Based Models for Assessing Complex Statistical Models: An Example Evaluating Selection and Social Influence Estimates from SIENA by Sebastian Daza, and L. Kurt Kreuger in Sociological Methods & Research
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The University of Wisconsin–Madison researchers are supported by core grants to the Center for Demography and Ecology, University of Wisconsin (R24 HD047873) and to the Center for Demography of Health and Aging, University of Wisconsin (P30 AG017266).
Supplemental Material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
