Abstract
Researchers examine contrasts between analysis of variance (ANOVA) effects but seldom contrasts between regression coefficients even though such coefficients are an ANOVA generalization. Regression weight contrasts can be analyzed by reparameterizing the linear model. Two pairwise contrast models are developed for the study of qualitative differences among predictors. One leads to tests of null hypotheses that the regression weight for a reference predictor equals each of the other weights. The second involves ordered predictors and null hypotheses that the weight for a predictor equals that for the variables just above or below in the ordering. As illustration, qualitative differences in high school math course content are related to math achievement. The models facilitate the study of qualitative differences among predictors and the allocation of resources. They also readily generalize to moderated, hierarchical, and generalized linear forms.
Keywords
Within the analysis of variance (ANOVA) context, hypotheses about contrasts between effects are common. While ANOVA is a special case of the general linear model where the regression weights correspond to effects, there is a smaller literature on testing hypotheses about contrasts between regression weights in a single group (Buse, 1982; Carlton-Ford, 1993; Davison et al., 2023; Davison et al., 2020; Engle, 1984; Evans & Savin, 1982; Graybill, 1976; Kiviet, 1986; Rindskopf, 1984). This literature has at least two camps. The first involves a penalty function (Buse, 1982; Engle, 1984; Evans & Savin, 1982; Kiviet, 1986). This approach adopts maximum likelihood optimization and χ2 test statistics. The second approach, called the substitution approach by Davison et al. (2020), uses ordinary least squares. It is named substitution because it uses a reparameterized linear model that replaces predictors by proxies that are linear combinations of the original predictors. Weights in the reparameterized model correspond to contrasts between the original weights. While first developed for linear regression (Carlton-Ford, 1993; Davison et al., 2020; Rindskopf, 1984), this approach was extended to generalized linear models for categorical criteria (Davison, Jew, et al., 2014) and latent variable modeling (Davison, Chang, et al., 2014). Here, we give two reparameterizations of the model, in which parameters are contrasts between regression weights. We also do a simulation study comparing Type I error and power for the Lagrange multiplier (LM) penalty and substitution methods.
One of our interests is testing the hypothesis of equality for a linear weight contrast of two predictor variables
In sum, this is akin to asking whether predictors differ qualitatively relative to the criterion.
What we propose will likely be of special interest when the predictors are in common units. For instance, calories come in carbohydrates, fats, and proteins. In a study of weight loss, one may be interested in whether a reduction of one calorie is associated with the same expected weight loss regardless of the source of the calorie. Cooper et al. (1992) studied whether bone loss from caffeine varied with the source of caffeine: coffee, tea, or other. Stated as a regression problem, when bone loss is regressed onto coffee caffeine, tea caffeine, and other caffeine; does one unit increase of caffeine lead to the same expected bone loss irrespective of its source? Below, we regressed high school math achievement test scores onto the number of instructional units taken by students in ten high school math subject areas (e.g., algebra 1, trigonometry, statistics). The research question is whether a unit of math instruction is associated with the same expected change in math achievement irrespective of subject matter? The question of whether the effect of a unit change in predictor is equivalent for the criterion is of special interest when the predictors are measured in the same unit.
Our goal is to extend the substitution approach for pairwise contrasts with more than two predictors. These models can be of interest for allocating psychological resources. One may wish to allocate resources to get more “bang for the buck,” which suggests qualitative differences between the resources relative to a criterion. The models we present also have inherent theoretical meaning. It is easier to explain the reparameterized model as one of resource allocation after deriving the model. Thus, we first explain the model.
The reparameterized model can be extended to a moderated model for testing interactions between the contrasts and person variables, such as sex or age. It can also be used to study the effect on the variance accounted for by deleting predictors. Deleting a predictor X v from the ordinary model will generally have a different effect on R 2 than will deleting that same variable from the reparameterized model obtained through substitution. The model also readily extends to generalized linear and hierarchical forms. It can be used with categorical dependent variables to test hypotheses about regression weight contrasts. None of this requires specialized software.
Substitution Approach
Substitution involves replacing predictors in the usual regression model with proxies, so weights on some predictors in the new model are contrasts between the pairs of weights in the original model. Each proxy variable is a linear combination of the original predictors. Rindkopf (1984) showed how to reparameterize the usual regression model, so the resulting parameters satisfy specified, linear constraints. Carlton-Ford (1993) used Rindkopf’s idea to reparameterize the original model for two predictors to test whether the two regression weights are equal. There are many ways to extend Carlton-Ford’s approach to more variables and we present two. The first we call the referent approach where one predictor in the original model (e.g., X1 ) is the referent, and the others are focal variables, predictors 2,…, V. The referent could be a standard or control condition. This formulation leads to regression weights in the reparameterized model that are contrasts between weights in the original model b v − b 1 for all focal variables v >1. For the second approach, ordered predictor approach, weights in the reparameterized model are contrasts b v − b v−1 from the original model. This formulation presumes an ordering of predictors. The ordering may be from theory, as in our following example. There are infinite reparameterizations possible with our two being potentially theoretically meaningful.
Here, we describe the substitution approach for four predictors and extensions to more. We show the referent and then the ordered approach. For both, it is posited that qualitative differences between predictors moderate the size of the predictor weights. For instance, if the predictors are calories, we posit that the expected change in the criterion resulting from increased calories varies depending on the source of the calorie. Thus, qualitative differences in carbohydrates, fats, and proteins moderate the expected change in weight. An important aspect of our approach is that it allows one to quantify effects associated with qualitative features of predictors.
Referent Substitution Approach
The following equations are in scalar form, as will be our proofs
where Y is the criterion, bv
is the regression weight for predictor v, Xv
is the predictor v, a is the intercept, and e is the error. Let X
1 be the reference variable, and the remaining predictors be focal variables. To reparameterize Equation 2a for the referent approach, we add and subtract
If we let T
1 be the sum of all predictor variables (1 to V), the first term on the right of Equation 2d becomes
Fitting Equation 2f involves regressing Y onto
Referent substitution is similar to reference cell coding in ANOVA. If for four groups, we let Group 1 be the referent and then let: X2
= 1 when Group = 2 and 0 otherwise, X3
= 1 when Group = 3 and 0 otherwise, and X4
= 1 when Group = 4 and 0 otherwise. Using Equation 2a, the predictive model is:
The weights b 2, b 3, and b 4 are mean contrasts between each group and Group 1. These weights are significant if the focal group’s mean differs from the referent group’s mean.
Ordered Substitution Approach
The Appendix shows that Equation 2a for four predictors can also be reparameterized as
Generalized to any number of predictors V, Equation 4b becomes
In Equation 4a–c, the v*th predictor (v* = 1,…, V − 1) is a sum
As with referent substitution, the ordered formulation also has a representation in ANOVA. For four groups let X1
= 1 when Group
Two aspects of Equation 4a–c are of interest. First, for the last three predictors, the regression weight is a contrast between two regression weights in Equation 2a. The t-statistic for testing a regression weight equals zero now tests whether contrasts between adjacent regression weights in Equation 2a are equal. Testing the null hypothesis that the X 2 weight is zero gives a test of the null hypothesis H 0: b 2−b 1= 0, for the first two regression weights in Equation 2a. In our math achievement example, we are interested in whether the predictors form a hierarchy in that the weight for each successive predictor is larger than the previous: b v > b v-1 for all v > 1.
A second important feature of Equation 4c (and Equation 2f) is its submodel that includes only the first predictor in the model:
This submodel is nested within the model of Equation 2a. It is a model where all predictors have the same regression weight:
where
Equation 4b and c gives rise to multicollinearity concerns. Because the predictor sums in these equations contain overlapping sums of variables, those predictor sums may be highly correlated. A simulation study reported in the following gives an indication of how this multicollinearity affects the global hypothesis test and the hypothesis tests for regression weight contrasts.
Resource Allocation Model
For Equation 2a, the regression weight for Xv is the expected change in Y given a one-unit increase in Xv holding all other predictors constant. As a model of behavior, this assumption may be violated in some contexts. Consider a study in which the units of study are organizations, the criterion is a measure of organizational success, and the predictors are budget allocations to various categories: for example, personnel, travel, equipment, and so on. Once the manager of an organization receives the annual budget, the manager can allocate dollars across categories but the total is fixed. The manager cannot allocate an additional dollar to personnel while holding all other expenditures constant, because that would exceed the budget limit by one dollar.
Within the reallocation scenario, there are two variations. First, each manager has the same budget. Here, the predictors are linearly dependent, and the analysis must accommodate dependencies (Davison et al., 2023). In the second scenario, managers’ budgets vary. The referent and ordered approaches model resource reallocations. In the referent model, each regression weight for variable 2 − V is the expected increase in Y obtained by transferring a unit from Variable 1 to Variable v holding all other predictors (and total) constant. The first predictor (the sum) represents the contribution of the budget total for a particular manager and accounts for the variation in Y attributable to variation in budget limits. The ordered equation approach models transfer from Variable v − 1 to Variable v, with the sum in the first predictor accounting for the effect of individual differences in budget total on Y.
The example above involves the allocation of financial predictors, but the allocation need not be financial. The predictors may represent time. A student may spend time in three reading activities: reading group, seat work assignments, and independent reading. The total time may vary by student, but given a particular total the student divides time between the activities. Terms can be added to test hypotheses about interactions: for example, independent reading and total reading time. The researcher may be interested in the multicollinearity of the model as the several sums in the ordered model will be correlated. The researcher may also be interested in the effect on R2 of dropping a predictor. The effect of dropping one predictor from the referent or ordered model would generally be different than the effect of dropping the same predictor from the ordinary model, because other predictors in these models differ from those in the ordinary model.
An allocation model is potentially useful for many psychological/educational concepts (e.g., attention span Bradbury, 2016; McClelland et al., 2013). Attention span is often conceived as a limit on time of attention. Attention can be allocated to different foci. On a math story problem, attention could be allocated to time to read the scenario, time allocated to the accompanying graph, time for the response alternatives, and so on. The predictors could be measured using eye tracking. Other concepts that might be modeled with an allocation framework include working memory (Cowan, 2005; Oberauer et al., 2016) and patience (Schnitker, 2012).
Methods: Simulation Studies
We compared Type I errors and power of the F- and t-statistics utilizing substitution to that of an alternative involving a penalty function, the LM approach.
LM Approach
Since the F-statistic of Equation 7 requires the estimation of the constrained and full model, it may be more convenient to use
The χ2 statistic of the LM test is:
where
When LM is applied to a linear regression with two predictors, the global hypothesis equals the pairwise comparison (i.e.,
The simulation consists of two studies, one to study Type I errors when all regression weights are equal. We studied Type 1 errors both for the global hypothesis
Type I Error Rates
We studied Type I errors for tests of the global and contrast hypotheses for analysis type (substitution vs. LM),
Population Regression Weight Common to the Two and Four Predictors with Each Combination of Predictor Correlation r xx' and Predictor/Criterion Correlation r xy in the Study of Type I Errors
Within each cell, there were 1,000 replications. For each replication, the predictors and criterion were sampled from a multivariate normal distribution MVN (0, Σ). In Σ, all predictor/criterion correlations were equal to a value
Power
The design for the power study was the same as for the Type I error study with two changes. First,
The two dependent variables for this study were power for the global hypothesis
For instance, when there were four predictors, mean
Power Study Two Predictors: Population Regression Weights for Each Combination of Predictor Correlation
For both the Type I and Power portions of the study, we fitted the ordered substitution model and used the tests of null hypotheses for the regression weights in that model to test whether regression weight contrasts in the original model equaled 0. Type I and power from fitting the substitution model were then compared to those for the LM.
Results: Type I Errors
Type I error rates for each condition are in Tables 4 and 5. ANOVA results on the experimental factors for the simulation study are reported selectively for larger effects. For two predictors (Table 4), Type I errors were acceptable for both the global and pairwise hypotheses under all simulation conditions (Bradley, 1978). In the ANOVA, the only statistically significant effect (p < .05) was sample size, and it accounted for little variance as measured by the effect size measure eta squared,
Power Study Four Predictors: Population Regression Weights for Each Combination of Predictor Correlation
Similar results were observed for four predictors (Table 5). Type I errors were acceptable for all conditions (Bradley, 1978). The only significant effect in the ANOVA was sample size, but that was small (
Results: Power
Power for two predictors is in Tables 6 (LM) and 7 (substitution). In the ANOVA, when the regression had two or four predictors, the largest effects were for sample size and
Type I Error Rates for Two predictors With Each Combination of Simulation Conditions a
a The global hypothesis is identical to the pairwise comparison (i.e.,
Type I Error Rates for Four Predictors With Each Combination of Simulation Conditions
Statistical Power for Two predictors With the Lagrange Multiplier (
a The global hypothesis is identical to the pairwise comparison (i.e.,
Statistical Power for Two predictors With the Substitution approach (F) a
aThe global hypothesis is identical to the pairwise comparison (i.e.,
With sample size 500, the global and pairwise tests were adequately powered for all conditions, where
Statistical Power for Four predictors With the Lagrange Multiplier (
a Avg. Pairwise stands for the average power of three pairwise null hypotheses (
Statistical Power for Four Predictors With the Substitution Approach (F)a
aAvg. Pairwise stands for the average power of three pairwise null hypotheses (
The pairwise statistics are less well powered than the global tests, given equal sample size. With a sample size of 40, the pairwise test was generally adequately powered for large effects of
With four predictors, global power was adequate for many conditions. The modest power of the pairwise tests did not always permit identification of the pairs that differed. The LM displayed slightly higher power for small samples. The differences disappear in larger samples. In smaller samples, the LM has slightly higher Type I errors but also slightly higher power. Thus, the LM approach was more prone to reject whether or not the null hypothesis was true for small samples.
High School Coursework and Achievement Example
To illustrate the ordered method, we used data from the High School Longitudinal Study (HSLS-09; Ingels et al., 2015; Ingels et al., 2014). There are 10 predictors corresponding to 10 categories of high school math courses shown in Table 10 (e.g., Algebra 1, Geometry, Algebra 2, etc.). The criterion is high school math achievement score. Our data are a subset of a larger dataset. Our sample size was 2,900, after incorporating a design effect of 4.0 to account for the cluster nature of the sample. Cluster samples take subjects within clusters who tend to be more homogenous, and thus, the number of subjects overestimates the independent information in the sample (Kish & Frankel, 1974). The sample size was reduced by a factor of 4.0 to compensate for dependencies arising from the cluster sampling. Each predictor is measured in the same unit; the number of Carnegie Units taken (CUs) by the student in a coursework area (Ingels et al., 2015). A score of 1 in high school algebra means the student completed 1 CU of Algebra 1. Based on course content and other factors, the National Center for Educational Statistics (NCES) rank ordered the courses, and the ordering is shown from low to high in Column 2 of Table 10. Lee et al. (1998) found that high school math achievement test scores were related to the level of the highest math course reached by the student. For our analysis, we were interested in whether a CU is associated with the same expected increase in math achievement regardless of the course in which it was earned. Specifically, we were interested in whether higher level courses were associated with larger expected achievement scores. Thus, we were interested in whether qualitative differences in course content moderate the effect of one CU on math achievement.
We fit four models to the data. As a baseline, we fit the reduced model of Equation 6 with a single predictor
Results of this second analysis are shown in Table 10. The first column shows the ten course categories ordered by NCES’s level of coursework. The third column shows the empirical estimate of the regression weight and statistical significance. All regression weights are significantly different from zero (p < .05). In what may seem counterintuitive, the regression weight for Algebra 1 is negative. Algebra 1 indicates whether the student took Algebra 1 in high school. Having delayed taking Algebra 1 until high school detracts from a student’s expected level of achievement, hence the negative regression weight for Algebra 1. For this full model, R 2=.396 and the adjusted R 2=.394. As compared to the submodel in which all weights are constrained equal, the full model almost doubles the variance accounted for (.215 vs. .396).
Regression Weights for Models Predicting Math Achievement From Coursework
Note. Adv. Math = other advanced math; AP IB = advanced placement/international baccalaureate course eligible for college credit.
Source. U.S. Department of Education, National Center for Education Statistics, High School Longitudinal Study of 2009 (HSLS:09) Base Year (2009), First Follow-up (2012, 2013), Transcript (2013, 2014).
*p < .05. **p < .01. ***p < .001.
Next, we tested the global hypothesis:
Third, we fit the referent model designating Algebra 1 as the reference variable and all others as focal variables:
This model provides a test that one CU of each focal course yields the same increase in expected math achievement as does one unit of Algebra 1. This model was fit by regressing the achievement test scores onto
Finally, we fitted the ordered reparameterization, the model in Equation 4c:
To fit Equation 13, we regressed the achievement test score onto
The substitution models in Equations 12 and 13 may or may not mirror student’s decision process, and the model need not do so to be of interest. However, the substitution models correspond to a model of the process in which the student first decides how many math courses they will take in high school and then allocates coursework within that constraint. In the substitution models, “holding all other predictors constant” includes holding the constrained total T 1 constant and thereby incorporating the constraint in the decision process. Specifically, here the referent model tests all courses against the standard Algebra 1 and the ordered models test the level of coursework as promoted by NCES.
Overall, the test of the global hypothesis suggests the expected increase in achievement with one CU of math coursework varies as a function of the qualitative course content. All other courses were associated with a higher expected increase than Algebra 1, and the predicted ordering of regression weights was partially supported. Substantively, results must be viewed with caution for two reasons. First, Algebra 1 does not reflect taking Algebra 1 as such, but rather the effect of delaying Algebra 1 until high school. Second, a difference in the weights for two courses may be due to differences in course content or differences in the students who choose to take the different courses.
Conclusions
Methods for estimating regression weight contrasts and their standard errors exist in the literature (e.g., Graybill, 1976). The major contributions of this manuscript stem from the idea of contrasts in ANOVA (a staple) and the connection to predictors in regression. As in ANOVA, there are infinitely many possible contrasts. Also, as with ANOVA, some are more theoretically relevant. We list two such models; the referent and ordered models in which contrasts between regression weights in the original linear (or generalized linear) model are the regression weights of the reparameterized model. These models contribute in several ways. First, they show the relation between regression and corresponding referent and ordered models in ANOVA. Second, they enable the study of the impact on a criterion of qualitative differences for predictors. Third, the reparameterized models provide a tool for studying how the allocation of points from one predictor to another affects the expected value of Y holding other predictors and the total of the predictors for each person constant. The substitution models provide the estimates of contrasts, their standard errors, and tests of hypotheses about the contrasts. These models also provide a way to study moderating effects of respondent variables (e.g., sex), interactions of predictors, multicollinearity of predictors, and effects of adding/subtracting predictors. The effect of adding or subtracting a predictor from the reparameterized model generally will not be the same as that for adding or subtracting the same variable to the ordinary linear model. Also, multicollinearity of a variable will be different in the reparameterized model.
As a basis for testing hypotheses about contrasts, our approach differs from some in the literature. Consider the hypothesis
Models discussed here are particularly useful for predictors measured in common units. Time is one such example. If a researcher is studying several activities, they can measure time spent in each. Regressing the criterion on the activity time predictors, a researcher can study whether the expected increase in the criterion is the same for all activities. In our example, we studied time spent in math education activities and its relationship to math achievement. Another example is cost. If a researcher is studying types of expenditures, the researcher can measure the amount spent on each expenditure. By regressing an outcome variable on the expenditure predictors, the researcher can study whether the expected increase in the outcome variable is the same for a dollar increase for each type of expenditure. A third useful unit is dosage.
Of special note, the analyses discussed here can be useful in analyzing data from an experiment or quasi-experiment. Treatments often involve multiple activities. In a study of a treatment containing three activities, the researcher can measure the amount of time spent by each respondent in each activity: for example, individual therapy, group therapy, and journaling (or the cost of each activity). By regressing the criterion on the activity predictors, one can evaluate whether a unit increase in an activity is associated with the same expected increase in outcome irrespective of activity type. The analyses can also be useful in broken randomized field trials in which respondents are randomly assigned to treatments, but some spend more time in their assigned treatment. Or some respondents not only experience their assigned treatment, but also one or more alternative treatments. These analyses give a way to analyze the experience of treatment rather than the intent to treat. In studies with a control group, the control treatment predictor can be the reference predictor in our referent model. In ANOVA, treatment is usually conceptualized as a discrete, all-or-none event, whereas in our models, a treatment can be conceptualized as an event that occurs in some amount varying over respondents and is represented in our model as a continuous predictor variable. Thus, the intensity of the treatment can also be taken into account.
In all simulation scenarios in this study, Type I error rates were adequately controlled by both the F and LM statistics. This was true for both the pairwise and global test. These results do not preclude the possibility of quite different results in other scenarios, such as that studied by Kiviet (1986) who studied time-series data to make decisions about model misspecification rather than regression weight contrasts. Our situation involves only a single time point and decisions about the equality of two or more regression coefficients.
Power was a complex function of sample size and
There were two surprising results involving multicollinearity. Larger values of
We extended the work of Carlton-Ford (1993) to V predictors. Note that our models can include regular predictors as well as contrasts. With more than two predictors, the substitution approach can have infinite manifestations, two of which were given here; the referent and ordered reparameterization. The global F-test for these models is well-known, but our new models lead to simple t tests of differences between pairs of regression coefficients in the original regression model, tests that will be of most interest for variables measured in common units. An alternative to our approach is the LM approach based on a penalty function. Rationale for the Lagrange statistic is weaker than that for the F- or t-statistic in small samples, since the former is based on asymptotic, large sample theory. Still, in our simulation study, the Lagrange performance was not markedly different from that of the F- and t-statistics.
When software for the implementation of a penalty function is readily available, the Lagrange approach can be extended to null hypotheses beyond our substitution approach by simple matrix specification of the equality restriction expressed in the null hypothesis. Substitution has the advantage that it can readily be extended beyond the ordinary least squares linear model. It can also be implemented in ridge, maximum likelihood, probit, logit, and multinomial regression. It can be implemented in virtually any software package for linear, hierarchical linear, and/or generalized linear regression. Our simulation suggests the Lagrange and substitution approaches have similar Type I error and power. Choice between the two may be a function of other factors: available software, type of regression model to be fitted (e.g., linear vs. probit), optimization function (e.g., least squares vs. ridge regression), and the form of the pairwise hypotheses.
In the ordinary regression model, regression weights are interpreted as effects of adding a unit to a predictor. In the reparametrized model, the weights correspond to contrasts between regression weights in the original model, and the contrasts are interpreted as the effect of transferring a point from one predictor to another. Our models enable the researcher to go beyond simple estimation and hypothesis testing of contrasts. The models allow one to study the effect of interactions between predictor variables in the contrast model, moderator effects of respondent variables (e.g., sex) on contrasts, multicollinearity of predictors, and the effects on R 2 of adding or deleting variables. Moreover, none of this requires specialized software.
Appendix
Ordered Substitution Approach
Here, we prove that Equation 4a and b is the reparameterization of the OLS model of Equation 2a. Let t be a subscript designating a new parametrization (t = 1,…., V), where V is the number of original predictors. Each iteration results in a new parameterization of the linear regression equation, parameterization t. Let the ordinary linear regression equation (2a) be defined as parameterization t = 1.
In each iteration t, the same weighted sum of predictors is both added to and subtracted from parameterization t, which after some rearrangement of terms and a little algebra yields the outcome parameterization (t + 1). The sum that is added to and subtracted from parameterization t to obtain the parameterization (t + 1) is of the form
where
For four predictors, the linear regression prediction equation is as follows:
where Xv
is predictor v, and
If we add and subtract this sum to parameterization 1 (Equation A2), we get
If we define Tv
as the sum of predictor variables v to V, then the first summation on the right of Equation A3d is
Equation A3e or A3f is the outcome parameterization t = 2, outcome for the first iteration.
If we add and subtract this sum to parameterization 2 (Equation A3f), we get
Substituting
Equation A4e is parameterization t = 3. We are now ready for the last iteration.
Adding and subtracting this quantity from parameterization 3 (Equation A4d) yields
Substituting
Equation A5c is parameterization t = 4 and the final one. It is a regression equation in which three of the predictors are sums (
Equation A5c gives the final parameterization for the special case of four predictors. For any number of predictor variables V, the general form for the final reparameterization is
If the predictors (
Supplemental Material
Supplemental Material, sj-docx-1-jeb-10.3102_10769986231200155 - Pairwise Regression Weight Contrasts: Models for Allocating Psychological Resources
Supplemental Material, sj-docx-1-jeb-10.3102_10769986231200155 for Pairwise Regression Weight Contrasts: Models for Allocating Psychological Resources by Mark L. Davison, Hao Jia and Ernest C. Davenport in Journal of Educational and Behavioral Statistics
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
