Abstract
The relationship between processes and time-varying covariates is of central theoretical interest in addressing many social science research questions. On the one hand, event history analysis (EHA) has been the chosen method to study these kinds of relationships when the outcomes can be meaningfully specified as simple instantaneous events or transitions. On the other hand, sequence analysis (SA) has made increasing inroads into the social sciences to analyze trajectories as holistic “process outcomes.” We propose an original combination of these two approaches called the sequence analysis multistate model (SAMM) procedure. The SAMM procedure allows the study of the relationship between time-varying covariates and trajectories of categorical states specified as process outcomes that unfold over time. The SAMM is a stepwise procedure: (1) SA-related methods are used to identify ideal-typical patterns of changes within trajectories obtained by considering the sequence of states over a predefined time span; (2) multistate event history models are estimated to study the probability of transitioning from a specific state to such ideal-typical patterns. The added value of the SAMM procedure is illustrated through an example from life-course sociology on how (1) time-varying family status is associated with women’s employment trajectories in East and West Germany and (2) how German reunification affected these trajectories in the two subsocieties.
Keywords
1. Introduction
Many theoretical questions in the social sciences address the relationship between longitudinal processes and time-varying covariates. Life-course and career researchers are interested in how changes in one life domain can influence trajectories in another (e.g., family and employment [Aisenbrey, Evertsson, and Grunow 2009]) and how changing economic conditions or family policies shape the transition to adulthood (Shanahan 2000). Social policy analysis is concerned with policy development processes that can be altered by specific events, such as wars or a change of government (Abbott 1995; Frank, Hironaka, and Schofer 2000). Related research questions are at the center of historical comparative sociology. Similarly, social movement scholars study how social movements unfold over time in response to trigger events (Minkoff 1995; Olzak 1989). Likewise, organizational ecology research examines how organizations develop over time: The relationship between processes and time-varying factors—for instance, the introduction of new technologies—is of core theoretical interest in this field as well (Carroll et al. 1993). These selected examples could be easily extended to other subfields of the social sciences. While the core units of analysis are located either on the micro, meso, or macro level, the similarity among all these examples is that they are not interested only in processes of metric outcomes, such as income or IQ. Instead, they focus on processes that consist of categorical states, including family trajectories, the implementation of specific policy programs, or the stages of group behavior prior to the outbreak of violent protest.
Two broad families of methodological strategies have been used to study the relationship between time-varying covariates and outcomes and trajectories of categorical states. The first strategy focuses on the occurrence of events or transitions (Allison 1984; Therneau and Grambsch 2000; Yamaguchi 1991) and relies on event history analysis (EHA) to estimate the effect of time-varying covariates on the risk of observing an event. However, the use of EHA is limited to modeling instantaneous changes, and it loses sight of the trajectory as a whole (Billari 2005). The second strategy emphasizes the holistic nature of trajectories or processes of categorical states by relying on sequence analysis (SA) (Abbott 1995; Studer and Ritschard 2016). While SA considers change and multiple transitions as lasting over longer time spans, in its traditional framework, studying the relationship between time-varying covariates and trajectories is impossible.
We propose an original combination of these two approaches, in what we call the sequence analysis multistate model (SAMM) procedure. The SAMM procedure is a stepwise application of (1) SA-related methods to identify ideal-typical trajectories understood as the sequence of states experienced by each individual during a given time span and (2) multistate event history models to study the probability of transitioning from a state to such ideal-typical trajectories. The SAMM procedure offers several advantages for studying processes. First, it allows for modeling the relationship between time-varying covariates and patterns of change within processes that unfold over long periods of time. This closely corresponds to the theoretical concept of trajectories as “process outcomes” (Abbott 2005). Second, studying trajectories holistically allows us to unveil potential interdependencies between states and transitions within trajectories. The social meaning of a given situation often depends on both previous and later events, which may be known in advance by the actors involved. For instance, an individual may start a new job although (or because) he or she knows that it will be only temporary. Finally, the SAMM procedure can handle censored observations, which is possible only to a very limited extent in the traditional SA framework, and this allows for inclusion in the analysis of trajectories that are only partially observed.
We demonstrate the added value of the SAMM procedure by using an original illustrative application in life-course sociology. Two theoretical principles in the life-course paradigm assume individual life-courses to be multidimensional (e.g., family and employment) and shaped by macrostructural and historical changes (Elder, Kirkpatrick Johnson, and Crosnoe 2003). We employ the historically unique “social experiment” of German reunification to exemplify how the SAMM procedure can contribute to a better understanding of these two core life-course principles. Specifically, we assess how (1) time-varying statuses in the family domain are associated with women’s employment trajectories in East and West Germany (multidimensionality of life-courses) and (2) how the German reunification affected women’s employment trajectories in the two subsocieties (i.e., the impact of macrostructural change). Beyond previous research on life-courses during the German reunification (Bonin and Euwals 2002; Hauschild 2002; Klammer and Tillmann 2001; Mayer 2006; Trischler and Kistler 2010), our application uses data from more recent birth cohorts. This allows us to track differences and similarities in women’s employment trajectories in East and West Germany not only in the immediate transition period but also up to 20 years following reunification.
The remaining sections of this paper discuss the following topics. Section 2 briefly introduces background information on the German reunification. Section 3 reviews the complementary strengths and weaknesses of SA and EHA. Section 4 presents the SAMM procedure. In Section 5, we apply the procedure to our illustrative example and further provide robustness checks. We compare results from the SAMM procedure with standard MMs.
2. Illustrative Example: The German Reunification
Between 1955 and 1990, Germany was divided into the Federal Republic of Germany (FRG) in the West and the German Democratic Republic (GDR) in the East. The two subsocieties differed greatly in terms of both the ideational and institutional characteristics of their labor markets and welfare systems.
The GDR promoted a universal-breadwinner model within a communist egalitarian stratification system and aimed at achieving population growth through pronatalist family policies. The constitution guaranteed the “right and duty to work” (Kreyenfeld 2004). Women’s labor market participation rates reached 90 percent (Huinink et al. 1995). Almost-universal and day-long child care facilitated labor market participation among mothers (Huinink et al. 1995).
In the FRG, the male-breadwinner model was the core organizing principle of social policies (Brückner 2004). Tax-splitting among spouses reinforced incentives for marriage and a male-breadwinner/female-homemaker specialization (Fasang 2014). The infrastructure for public child care was limited, particularly for children under three years of age. In addition, the FRG was characterized by a normative climate in which mothers’ employment was regarded as harmful to small children (Treas and Widmer 2000). During the decades before reunification, the female labor market participation rate in the FRG was only around 50 percent.
With reunification, the Eastern federal states adopted Western labor market institutions and social policies. The dramatic changes in the occupational structure and the destruction of about one-third of the jobs in the East led to persistent disadvantages in the economy of the former GDR (Goldstein and Kreyenfeld 2011; Kreyenfeld 2003). In the first years following reunification, the rapid privatization of the economy was counterbalanced by huge shares of subsidized jobs in the public sector, early retirement schemes, and generous social security transfers (Franz and Steiner 2000). However, after this short period, lower wages and higher unemployment rates continued to characterize the East compared with the West long after reunification (Goldstein and Kreyenfeld 2011). This challenged the initial expectations of progressive convergence toward a Western standard, following a critical adaptation period.
Most of the previous studies on the effects of reunification on labor market participation have focused on the West (e.g., Aisenbrey et al. 2009; Biemann, Fasang, and Grunow 2011; Brückner and Mayer 2005; Gundert and Mayer 2012; Manzoni, Härkönen, and Mayer 2014; Mayer, Grunow, and Nitsche 2010), examined differences before or after reunification in one of the two subsocieties (e.g., Diewald and Mach 2006; Diewald, Solga, and Goedicke 2006; Solga and Diewald 2001), or compared the East with the West only before or only after reunification (e.g., Diewald 2006; Diewald et al. 2006). A few studies made comparisons across contexts and periods simultaneously but relied on data for relatively old cohorts (i.e., up to 1970) (Mayer, Diewald, and Solga 1999; Rosenfeld, Trappe, and Gornick 2004; Simonson et al. 2011). Therefore, we know little about the labor market experiences among younger cohorts following reunification. This research gap originates both in a lack of appropriate methodology for studying the impact of macrostructural change on longitudinal life-course and the lack of data covering a sufficiently long time span.
In this context, we address two recurrent core research questions in life-course research: (1) how different life domains—namely, employment and family—are interrelated (Elder 1974; Elder et al. 2003) and (2) how macrostructural changes—such as the German reunification—shape individual life-course trajectories. In both cases, examining the relationships between time-varying covariates and trajectories is crucial.
Our analysis is based on the retrospective data of the Starting Cohort Six of the National Educational Panel Study (NEPS) (Blossfeld, Rossbach, and von Maurice 2011) for women born between 1944 and 1990 in East and West Germany (N = 731 and N = 3,406, respectively). We constructed individual employment trajectories from age 15 years to the maximum age of 40 years by coding each month according to one of three states: in education, employed, or out of employment (OE). For simplicity, in this illustrative application of the SAMM procedure, we did not distinguish between different types of education or different reasons for being OE, which, however, would be technically feasible.
3. Existing Methodological Approaches
We now present sequence analysis and event history analysis and highlight their complementary strengths and limitations from both methodological and conceptual viewpoints.
3.1. Sequence Analysis
Sequence analysis (SA) provides a holistic view of processes described as sequences—that is, successions of categorical states (Abbott 1995). From a more technical viewpoint, SA relies on a distance measure between sequences (or trajectories) of states, which allows for their comparison (Abbott and Forrest 1986; Elzinga 2005; Müller et al. 2008). Several distance measures are available, and choosing one should be based on their sensitivity in accounting for differences in timing, duration, or sequencing (for a review of distance measures, see Aisenbrey and Fasang 2010; Robette and Bry 2012; Studer and Ritschard 2016). The distances can be further analyzed by using discrepancy analysis (Struffolino, Studer, and Fasang 2016; Studer et al. 2011), multidimensional scaling (Piccarreta and Lior 2010), and cluster analysis to group similar trajectories (for a review of the available clustering algorithms in SA, see Studer 2013). The outcome of this last procedure is a typology. The types are then interpreted as describing the main ideal-typical processes or trajectories. The remaining variation in the sequences within each type is usually ignored, assuming that a description of the social world requires a certain degree of simplification and that deviations reflect different realizations of the same underlying process (Studer 2013).
Short-term changes 1 and long-term dynamics are simultaneously considered. These features are in line with the life-course paradigm, which stresses the importance of studying the unfolding of trajectories understood as sequences of roles and social statuses (Elder et al. 2003). Therefore, SA is one of the most promising methods to study life-course research questions (e.g., Brzinsky-Fay 2007, 2010, 2014; Liefbroer and Toulemon 2010; Mayer 2009; Shanahan 2000).
Within the SA framework, change is operationalized as lasting over a period of time rather than as instantaneous. As Shanahan (2000) points out, important transitions can occur over several months or years and are usually less well defined than the study of a single event would suggest. He argues that important transitions may result from a succession of events. Abbott (2009) puts forward similar arguments when discussing the notion of “turning points” within processes. Finally, Brzinsky-Fay (2014:218) states that “measuring transitions means capturing a process with a specific time dimension.”
Brzinsky-Fay (2007) advocates for considering longer periods of time when studying employment trajectories as labor market integration or exclusion are processes that last over extended periods (see also Brzinsky-Fay 2010). Moreover, the labor market entry process might have become more complex in increasingly volatile labor markets with numerous internships, temporary jobs, and unemployment spells (Brzinsky-Fay 2007).
Sequence analysis additionally allows us to consider interdependencies between states in a trajectory in terms of the duration, timing, and sequencing of states (Abbott and Forrest 1986; Studer and Ritschard 2016). These dynamics illuminate the internal logic of trajectories by highlighting the necessary or avoidable steps, important turning points, and typical pattern of changes. The sequencing situates single events within longer trajectories. This is important because the social meaning and/or consequences of an event may depend on both previous and later events (Elder et al. 2003). For instance, Brzinsky-Fay (2010) points out that the meaning of part-time employment depends on both previous and later states in employment trajectories. Part-time employment can signify either an employment entry or an exit process. Similarly, the social meaning of unemployment, for example, depends on its duration and timing with regard to other employment or nonemployment states (Brzinsky-Fay 2014). The same applies to other life domains—for instance, the duration between marriage and first childbirth.
However, SA also has several limitations. First, since trajectories are analyzed as a whole, when treating them as dependent variables, we can examine their relationships only with constant attributes or covariates measured before the starting point of the trajectory. Including covariates measured later—such as those in the middle or at the end of the trajectories—leads to conceptual issues related to the well-known problem of anticipatory analysis (Hoem and Kreyenfeld 2006). Indeed, disturbing the temporal order of events leads to accounting for the trajectories, or at least part of them, by a future measurement. The inclusion of censored observations is problematic even when using normalized distances or by adding a missing value state at the end of the sequences. Indeed, after clustering, the resulting typology is often based on the length of the sequences (i.e., observation time), and it rarely has substantive meaning (Elzinga and Studer forthcoming). Because the processes being compared by the distance measure need to be fully observed, most of the studies analyze only complete trajectories to preserve the holistic perspective. Therefore, the sample size is reduced and one usually excludes the observations for younger cohorts of individuals because their trajectories are only partially observed.
To analyze changes in the employment trajectories during German reunification, a conventional application of SA would select a set of fully observed trajectories and build a typology of the sequences. The link with the reunification would then be analyzed by looking at the frequency of each type of sequence in each birth cohort. However, this would offer a very crude estimation because most trajectories would include time points both before and after reunification. Moreover, we would exclude censored observations (i.e., from women born after 1970 whose employment trajectories are not observable from ages 15 to 40). Therefore, none of the trajectories would have started after reunification.
3.2. Multistate Models
Event history analysis (EHA) is another framework widely used to study processes and transitions between states. It includes a number of methods for estimating duration between two events—such as starting and ending an employment spell—or in a more or less similar way, the hazard of experiencing the second event after the first one.
Within this framework, multistate models (MMs) analyze state sequences by focusing on the hazard rate of observing transitions between states and, implicitly, the time spent in each state (Andersen and Keiding 2002; de Wreede, Fiocco, and Putter 2011; Putter, Fiocco, and Geskus 2007; Steele, Goldstein, and Browne 2004; Therneau and Grambsch 2000). Figure 1(a) visualizes the MM for our illustrative application and shows all possible transitions between the states used to define individual sequences: education, employment, and OE. 2

(a) The multistate model and (b) estimation strategy.
MMs estimate the risk of experiencing a transition
MMs and the EHA framework have several advantages. First, MMs allow for the simultaneous analysis of transitions between several states and the time spent in each spell, which are crucial features of the dynamics of trajectories. Second, censored observations (i.e., individuals whose trajectories are not fully observed) can be included in the analysis. Finally, they allow us to measure the influence of possibly time-varying explanatory factors on the occurrence of a given event (Allison 1984; Blossfeld and Rohwer 2002; Courgeau and Lelièvre 1993; Hosmer and Lemeshow 1999; Yamaguchi 1991).
Some limitations of MMs have to be acknowledged. MMs conceive transitions as instantaneous—that is, they occur at one specific time point. As stated earlier, conceptualizing and analyzing changes and transitions over longer time spans is important. Moreover, by focusing on transitions rather than on the longitudinal sequencing of states, MMs fail to take a global view of the unfolding of trajectories and the interdependencies among states over time.
By using an MM for our illustrative application, we would be able to estimate how the transitions between two states is correlated with German reunification. Furthermore, the approach would allow us to include censored observations in the analysis. However, we would lose sight of the whole trajectory and not make a distinction, for instance, between working summer breaks and transitioning from education to stable employment, which is a crucial substantive difference.
4. Sequence Analysis Multistate-Model Procedure
So far, none of the available approaches fully addresses the methodological challenge of estimating the effect of time-varying covariates on trajectories. We propose the sequence analysis multistate-model (SAMM) procedure, which combines these two approaches in a stepwise analytical strategy. The SAMM procedure preserves the advantages of both approaches: It conceives change as lasting over a medium-term period while allowing us to study how time-varying explanatory covariates shape trajectories.
The SAMM procedure consists of two steps. First, we use an adapted form of SA to study the typical sequencing and the duration between the main events marking the trajectories over a medium-term period. Then, we use an MM to estimate the chances (or risks) of following each kind of typical sequence, as identified in the first step. We detail each of these steps on a general level, highlighting the necessary choices and available options. The illustrative application of the method follows in the next section.
A script distributed as a supplemental online appendix provides a step-by-step guide on how to implement the SAMM procedure in R.
4.1. Step 1: Identifying Typical Subsequences of Change
In the first step of the SAMM procedure, we identify subsequences that describe what follows over the specified period after each transition between two states. We then build a typology of these subsequences to identify ideal-typical sequences of changes along the trajectories.
4.1.1. Extracting Subsequences
To extract subsequences of consecutive states, we first set the time span, denoted by
We extract only the subsequences following a transition that occurred before
Figure 2 provides three examples of the extraction procedure. Sequence 1 represents a woman’s employment trajectory that starts with an education spell. After 46 months, she experiences a transition from education to employment. In this example, we set

Three examples of the subsequence-extraction procedure. Extracted subsequences are marked with solid or dashed rectangular borders. The censored time is represented by the vertical black bar.
If
There are several substantive reasons for studying subsequences. First, the subsequences following a transition may be known (at least to some extent) by the actors. For instance, students may plan a short-term job during the summer break before restarting their studies in autumn. The same applies if a woman accepts a new job even if (or because) she knows that it is only short term. These subsequences may therefore convey meaningful information even from the viewpoint of the actors.
Second, the subsequence that follows a prior transition may be a consequence thereof. That is, the first transition may be the stepping-stone for a more profound transition lasting over a certain period. As we already argued in Section 3.1, transitions are not necessarily instantaneous and may last for a period of time (Abbott 2009). 4 In breaking down a change into single transitions, we may fail to describe the underlying dynamics of the trajectory (Shanahan 2000). By using subsequences, we implicitly consider the dynamics of change over a medium-term period—something that is more substantively meaningful in many applications.
4.1.2. Subsequence Clustering
Once all subsequences have been extracted, we cluster them by using SA to build a typology. The obtained ideal-typical subsequences summarized in clusters can be interpreted as typical changes along the trajectories that follow a transition between states. This step requires choosing a distance measure for comparing the sequences and a clustering algorithm.
According to Studer and Ritschard (2016), the choice of a distance measure is a substantive one, which should be based on the performance of a distance in accounting for three dimensions: timing, duration (or spacing), and sequencing. Note that analyzing time since first transition is different from analyzing time understood as age, for instance. So if there is a substantive reason to focus more on timing or sequencing, the distance measure should reflect this interest. 5
The standard sequence clustering procedure (Studer 2013) needs to be adjusted to generate meaningful distinctions for trajectories following a given spell. In the following MM analysis, we analyze the risk of following each type of subsequences, departing from a given state. A separate cluster analysis is conducted for each pool of subsequences that start with the same state (i.e., one of education, employment, or OE for our example application). In all studies that use a SA typology for subsequent analysis (either as an independent or dependent variable), the cluster quality should be sufficiently high to guarantee high within-cluster similarity (Studer 2013). Furthermore, we should ensure that within cluster heterogeneity is not linked to key factors such as age at the start of the subsequence. In our specific case, we can expect a higher cluster quality since the subsequences are shorter than the whole sequence and probably show lower variation. For this reason, they are probably less complex to cluster.
4.2. Step 2: Estimating the Relationships between Trajectories and Time-varying Covariates
After identifying the typical subsequence of changes, we use MMs to estimate the relationships between covariates and the clusters of typical subsequences. Using this approach, we assess how the covariates are associated with the hazard rate of following each type of subsequence cluster while departing from a given state. Individuals with censored observations and those who did not experience any subsequences (i.e., those in a stable sequence without any transition during the entire observation period) are included in the analysis in the at-risk population.
The competing risks are not transitions between states but rather transitions from one state to a sequence of states over a medium-term period (here five-year subsequences). To this purpose, MMs can be adapted to study competing risks, when the different states can be interpreted as different starting conditions for the competing events (Steele et al. 2004).
4.2.1. Estimation Method
Any estimation method suitable for competing risks (see e.g., Andersen and Keiding 2002) can be used for MMs (for more information, see the excellent tutorial by Putter et al. 2007). We propose to consider two factors when choosing among these methods: the underlying measurement of time and the possible recurrence of spells for the same individual trajectory. First, if the true durations to be estimated are continuous, we then recommend using Cox models, which are relatively simple and widely available. When the time dimension is measured on a discrete time scale, the “Efron” method should be used to appropriately handle ties in durations (for a review of ties handling methods, see Hertz-Picciotto and Rockhill 1997). On the other side, when the durations to be estimated are discrete or when the durations are only measured with a very crude approximation such as years, then logistic or multinomial discrete-time models should be preferred (Allison 1982).
Second, if several spells in the same state can be observed in the same sequence, then a model including a frailty term or a random intercept should be preferred. These models relax the assumption of independence of spells belonging to the same individual (Blossfeld and Hamerle 1990; Galler and Poetter 1990; Mayer and Tuma 1990; Wu 2003). Models with frailty terms also provide more accurate estimates of individual-level covariates (Bijwaard 2014) and consider unobserved heterogeneity. 6 One can use a Cox proportional hazard model with frailties or random intercept (Lunn and McNeil 1995; Putter et al. 2007; Therneau and Grambsch 2000) or a multinomial or logistic discrete-time models with random intercept (Steele et al. 2004). These models take into account multiple observations (in this case subsequences) nested within individuals. On the other side, if each spell can occur only once in each sequence, then the usual Cox model or logistic model can be used.
When using a Cox model or a logistic discrete-time model (with or without a random intercept), we usually estimate a separate model for each competing risk (i.e., typical subsequences of changes here). In this setting, we estimate the chances to experience a specific typical subsequence of changes instead of (1) any other or (2) remaining in the spell. This is achieved by considering a new data set where the competing risks are recoded as censored observations. This strategy assumes that the effect of covariates is transition-dependent and cause-specific hazard rates are nonproportional (with a separate risk curve being used for each transition).
5. Illustrative Application of the SAMM Procedure
We now present an application of the SAMM procedure to assess (1) how time-varying statuses in the family domain are associated with women’s employment trajectories in East and West Germany and (2) the association between the reunification and women’s employment trajectories.
5.1. Step 1: Subsequence Extraction and Clustering
For the first step of the SAMM procedure, we extracted subsequences from the trajectories and clustered them. Here, we chose a subsequence length
We then clustered the extracted subsequences by using SA. We chose the optimal matching distance measure with constant costs, 7 which is sensitive to duration—a key aspect of employment trajectories—while still being sensitive to timing and sequencing (Studer and Ritschard 2016). We used partitioning around the medoid clustering method as it has the advantage of minimizing a global criterion measuring residual variation (Kaufman and Rousseeuw 1990). The number of clusters was chosen to maximize the average silhouette width (ASW). We obtained good cluster quality according to the thresholds given in Kaufman and Rousseeuw (1990), with three groups for each pool of subsequences starting with education (ASW = .65), OE (ASW = .64), or employment (ASW = .62). The relative frequency plots presented in Figure A1 of online Appendix A further confirm this high within-cluster homogeneity. We also made sure that the within-cluster heterogeneity is not linked to age using index plots as presented in Figure A3 of online Appendix A. Even if we keep three clusters after each ending spell, the choice of the same number of groups for each set of subsequences is not compulsory. For example, we could very well have chosen only one type of transition for leaving an education spell (regardless of the destination) and three for the others depending on the ASW values.
Figure 3 shows state distribution plots of the subsequence clusters for each starting state. The clusters are labeled according to the medoid (i.e., the subsequence with the lowest average distance to all others in the cluster). The percentages in Figures 3(a), 3(b), and 3(c) refer to the share of extracted subsequences starting in education, out of employment, and employment, respectively, and not to individual sequences. For instance, the percentages in Figure 3(a) are calculated as the number of subsequences in each cluster divided by the number of subsequences that start with education.

Clusters of subsequences: distribution of the states (
Figure 3(a) presents the clusters related to education. First, for 10 percent of the subsequences, education is followed by being OE either immediately or within the next five years (left panel). Second, almost 25 percent of the transitions out of education are only temporary, with a quick return usually after two months (middle panel). The brief interruptions in continuing education consist of phases both in and out of employment. Finally, almost two-thirds of subsequences that start with education are swiftly followed by employment that lasts for the next five years (right panel). Some women in this group experience a short OE spell before starting employment.
Figure 3(b) presents the clusters following a move away from OE. First, in one-third of the subsequences, women return to education and mostly continue into employment in the next five years (left panel). This pattern likely follows the two-month break of education identified previously. Second, the majority of OE spells end with employment uptake (54 percent, middle panel). Finally, 13 percent of the subsequences show high employment volatility with only brief temporary employment spells before returning to OE.
The clusters after leaving employment are shown in Figure 3(c). First, some women return to education (around 16 percent of the subsequences, left panel). Additional visualization with sequence index plots (available on request from the authors) showed that recurrent transitions between employment and education are common in this group. Second, almost half of the subsequences (47 percent) are shown to be OE over the next five years (middle panel). Finally, 37 percent return to employment after a short period of OE, often within less than one year.
The subsequence clusters identified trajectories of back-and-forth dynamics, such as “Edu–OE–Edu,”“OE–Empl–OE,” or “Empl–OE–Empl.” These kinds of dynamics enable us to distinguish between short summer jobs and more stable transitions into employment and highlight the volatility of employment trajectories with recurrent moves of “OE–Empl–OE” or “Empl–OE–Empl.” In addition, the clusters summarize not only instantaneous transitions but also “medium-term transitions” between states within a five-year period. In contrast to direct instantaneous transitions (i.e., between two states, without an intermediate state in between), we define medium-term transitions as subsequences in which the transition to another state occurs after a longer period of time. The clusters of “Edu–Empl,”“Edu–OE,”“OE–Empl,” and “Empl–OE” exemplify medium-term transitions. Table 1 highlights the difference between instantaneous and medium-term transitions in a contingency table. More than 50 percent of the direct transitions “Edu–OE” in fact are part of a subsequence of “Edu–OE–Edu–Empl.” Therefore, direct “Edu–OE” transitions are difficult to interpret out of the context of the longer-term trajectory. The SAMM procedure allows us to clearly identify these medium-term transitions and distinguish them from faster back-and-forth movements, which is not possible in the conventional MM.
Percentage of Direct Transitions Classified in Each Subsequence Cluster
Note: EDU = education; EMPL = employed; OE = out of employment.
Source: NEPS data.
5.2. Step 2: Multistate Models
In the second step of the SAMM procedure, we estimate the effect of explanatory factors on the chances to follow each type of subsequences using an MM. Figure 4 presents the MM for our example: Each arrow is a hazard rate to be estimated. We therefore need to estimate nine hazard functions (i.e., one for each pair of ending spell and typical subsequence cluster).

Estimation strategy for the multistate model combined with sequence analysis.
The underlying durations to be estimated here are continuous, and we might observe several spells in the same state for each individual. We therefore used a Cox model with random effects to estimate the MM. The following covariates were included in the MM.
The relationship between women’s employment trajectories and family status is estimated by including two covariates: being in a union and having at least one child. 8 Family status is expected to be related to transitions in and out of employment. A covariate for East and West and two separate dummies for reunification in the East and West were included as proxies of macrolevel change.
We additionally controlled for the period, measured in months and coded as a continuous variable. In fact, several developments besides reunification—such as cultural change and educational expansion—might have affected employment trajectories within our observational window. By omitting the period covariate, we would have attributed all changes in employment trajectories to the reunification of the two German subsocieties. 9
We also included several control variables. First, age was added as a three-degree polynomial transformation to ensure that the other effects are not related to age. 10 Second, we included parental education, measured as the highest number of years of education between the parents. Finally, we introduced dummies for calendar months because education-related transitions (starting or ending) typically occur between June and September and to control for yearly economic cycles (e.g., those related to unemployment). Moreover, given the retrospective nature of our data, recall bias could result in a higher number of transitions during some key months, such as trimesters, semesters, or the beginning of the year. To facilitate interpretation, we standardized all continuous covariates; their coefficients are therefore unit-free.
5.2.1. Interpretation
Table 2 shows the SAMM procedure’s estimates for women’s employment trajectories in East and West Germany. The estimates refer to the association between (time-varying) covariates and the likelihood of ending a spell in one of the ideal-typical subsequence clusters. Overall findings both on the association between family life-courses and the German reunification on women’s employment trajectories are largely in line with expectations.
Sequence Analysis Multistate Model (SAMM) Procedure for Women’s Employment Trajectories in East and West Germany
Note: EDU = education; EMPL = employed; OE = out of employment; AIC = Akaike Information Criterion.
Source: NEPS data.
p < .10. *p < .05. **p < .01. ***p < .001.
Having a child reduces a woman’s likelihood to enter any of the ideal-typical subsequences following OE. Women therefore tend to remain out of employment longer after having a child. In addition, they have a higher likelihood to exit from either education or employment into long-term OE after childbirth. Being in a cohabiting or married union is associated with shorter education spells for women that either lead to employment or withdrawal from the labor market. Whether employment uptake or withdrawal occurs after entering a union likely depends on the partner’s employment status and the couples’ gender-specific division of labor.
Concerning our first research question on the impact of family life-courses, the SAMM procedure uniquely allowed us to distinguish the differential impact of having a partner on women’s periods out of employment: Although women in a union are more likely to experience long-term periods out of employment, short-term transitory periods out of employment within overall more dynamic trajectories are more frequent for women who are not in a union.
Concerning reunification, we find stronger effects for East German women’s employment than for their West German counterparts. This is in line with the profound shift from the communist GDR to the capitalist FRG model in the East while the institutional context in the West remained similar. Findings support both convergence and divergence of women’s employment trajectories after the reunification. 11
On the one hand, women’s employment trajectories in both subsocieties converged on a higher likelihood of extended periods out of the labor force after the reunification, albeit the coefficients are mostly insignificant in the West. Note that periods out of employment cover both unemployment and family-related leaves in our analysis with arguably more unemployment in the East and more family-related interruptions in the West.
On the other hand, we also see persistent differences after reunification with a higher likelihood of extended periods out employment for West German women compared with East German women. This is expressed both in East German women’s higher likelihood of entering the straightforward route to employment in the “OE–Empl” (column 6) group and their lower likelihood of exiting employment via the “Empl–OE” (column 9) group compared with the West. Importantly, East German women’s employment trajectories have diverged from their West German counterparts with higher employment volatility. For East German women, we find an elevated likelihood of transitioning into frequent back-and-forth movements between employment and nonemployment after reunification, which is not the case in the West. This is visible in the positive reunification effects for the East (East: Reunif) on entering the “OE–Empl–OE” (column 7) and the “Empl–OE–Empl” (column 10) clusters that are insignificant with a negative sign for the West (West: Reunif). Higher employment volatility for East German women after reunification is further substantiated with their lower likelihood to enter more stable employment trajectories, such as the “OE–Empl” (column 6) pattern. Also with regard to our second research question on the German reunification, the SAMM procedure enabled us to identify East German women’s higher employment volatility with various back-and-forth movements as one of the most distinctive diverging trends in East and West German women’s employment after reunification.
In addition to these main findings, the period and age covariates can also be of interest. The period captures general trends in employment trajectories over time. It shows the most pronounced effects on subsequences related to education. The findings support that both the duration of education and the likelihood of reentering education after employment or periods out of employment has increased over time. This is visible in the negative period effect for the “Edu–OE” (column 2) and “Edu–Empl” (column 4) clusters as well as the positive period effects for patterns of reentry to education: the “Edu–OE–Edu–Empl” (column 3), “OE–Edu–Empl” (column 5), and “Empl–Edu–Empl” (column 8) clusters. Age is certainly important in many applications. Some ideal-typical patterns might be more likely to occur at younger or older ages. As expected, in our application, the hazard of entering subsequences that mark a return to education decreases with age: “Edu–OE–Edu–Empl” (column 3), “OE–Edu–Empl” (column 5), or “Empl–Edu–Emp” (column 8).
5.2.2. Robustness Check: Varying Subsequence Length
We ran the SAMM procedure with different subsequence lengths
5.2.3. Comparison with Standard MMs
To highlight the advantages of the proposed methodology, we compare results from the SAMM procedure with those from conventional MMs, in which competing risks are direct transitions between states (Table 3). In this case, MM does not identify an increase in medium-term transitions to OE after education in the East. In the subsequence-based model, we can distinguish between short education interruptions (which do in fact constitute a continuation of education) and medium-term transitions to OE: In the conventional MM, the two subsequences are aggregated.
Transition-based Multistate Model for Women’s Employment States in East and West Germany
Note: EDU = education; EMPL = employed; OE = out of employment; AIC = Akaike Information Criterion.
Source: NEPS data.
p < .10. *p < .05. **p < .01. ***p < .001.
Importantly, by using the conventional MM, we are not able to observe any increase in back-and-forth movement between OE and employment in the East after reunification—because, for instance, transitions to short-term or medium-term employment cannot be distinguished. Finally, we can see that the statistical significance of the “East: Reunif” coefficients are somewhat smaller in the conventional model, indicating that the distinction between subsequence types are relevant to detect changes in the degree of volatility of women’s employment trajectories.
6. Extensions of the SAMM Procedure
We discuss here some possible extensions of our proposed framework. The SAMM procedure relies on two kinds of information: a sequence of spells
The subsequences
Finally, in some applications, the influence of previous paths on subsequence of change is of central interest. In this case, we suggest adding indicators of previous paths to the MMs. For instance, we can add the time already spent in each state or dummies for the states experienced to the models.
7. Conclusions
The relationship between time-varying covariates and processes is at the base of a number of theoretical questions in the social sciences. As a combination of SA and an MM, the SAMM procedure allows us to study patterns of change—namely, subsequences following a transition—along processes. Compared with the conventional methodological approaches, the SAMM procedure offers several advantages to the analysis of the relationship between trajectories and time-varying covariates for different units of analysis. Our illustrative study of women’s employment trajectories in Germany highlights the SAMM procedure’s usefulness by identifying an increase in volatility of employment trajectories in the East after reunification, signified by more frequent medium-term transitions from education to OE, more recurrent back-and-forth dynamics between OE and employment, and more usual long-term OE spells after education while being in a union.
More generally, within the SAMM procedure, transitions, turning points, and changes are conceived as lasting over a certain period; they are not instantaneous events, as is usually assumed in EHA. Furthermore, studying patterns of change makes it possible to uncover potential interdependencies among states and transitions along the trajectories. The SAMM procedure also considers the duration of a spell—an important dimension of life trajectories and processes. More importantly, within this framework, we can estimate the effects of time-varying covariates on the identified patterns of change. This is crucial for our illustrative example so that we can not only study how individual trajectories are linked to macrostructural changes but also address the issue of linked lives or how various life domains are entwined (Elder 1974). Finally, unlike the typical SA, the SAMM procedure can handle censored observations: In our application, this characteristic makes it possible to include younger cohorts whose trajectories are only partially observed.
In this article, we presented an illustrative application of the SAMM procedure within the field of life-course research, but the general framework can be extended to a wide range of fields and disciplines where there is a theoretical interest in studying the complex relationship between time-varying factors and processes that are coded as a sequence of categorical states.
Footnotes
Acknowledgements
We warmly thank the anonymous reviewers for very helpful comments. We also greatly appreciate the suggestions made by participants in the USP Writing Workshop at WZB Berlin on an earlier version of the manuscript.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This publication benefited from the financial support of the Swiss National Centre of Competence in Research LIVES—Overcoming Vulnerability: Life Course Perspectives (NCCR LIVES), which is financed by the Swiss National Science Foundation (Grant Number 51NF40-160590).
Notes
Author Biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
