Abstract
Connectivity analysis characterizes normal and altered brain function, for example, using the phase lag index (PLI), which is based on phase relations. However, reliability of PLI over time is limited, especially for single- or regional-link analysis. One possible cause is repeated changes of network configuration during registration. These network changes may be associated with changes of the surface potential fields, which can be characterized by microstate analysis. Microstate analysis describes repeating periods of quasistable surface potential fields lasting in the subsecond time range. This study aims to describe a novel combination of PLI with microstate analysis (microstate-segmented PLI = msPLI) and to determine its impact on the reliability of single links, regional links, and derived graph measures. msPLI was calculated in a cohort of 34 healthy controls three times over 2 years. A fully automated processing of electroencephalography was used. Resulting connectomes were compared using Pearson correlation, and test–retest reliability (TRT reliability) was assessed using the intraclass correlation coefficient. msPLI resulted in higher TRT reliability than classical PLI analysis for single or regional links, average clustering coefficient, average shortest path length, and degree diversity. Combination of microstates and phase-derived connectivity measures such as PLI improves reliability of single-link, regional-link, and graph analysis.
Introduction
O
The observation of short lasting and reiterating electric field distributions over the scalp has led to the concept of microstate analysis (Pascual-Marqui et al., 1995), which describes a relatively small number of quasistable topographies in the 30–1000 ms time range, consistently found in numerous studies (Khanna et al., 2015; Koenig, 2002). Duration and frequency of these microstates are reliable over time in single subjects (Khanna et al., 2014). Converging evidence points to microstates as representatives of different subnetworks at different scales (Britz et al., 2010; Musso et al., 2010, p. 201; Stam and van Straaten, 2012a; Van De Ville et al., 2010). It can therefore be hypothesized that connectivity analysis within the periods of each specific microstate class might group together specific network activity and thus give more reliable results than the general PLI method.
As the interpretation of connectivity results with high number of different links may be difficult, graph theoretical measures were introduced in the past years, characterizing networks by integrated measures. These measures help to detect and describe differences in networks across studies.
In this study, we calculate PLI based on 4-sec epochs specifically within segments of each microstate class (msPLI) in a cohort of 34 healthy subjects. We compare msPLI and derived graph measures to previously reported PLI results in the same cohort (Hardmeier et al., 2014), and determine its test–retest reliability (TRT reliability) over three time points within 2 years.
Materials and Methods
Subjects
Forty healthy volunteers (age: median 51.8, range 20.0–68.1, 79% female) with no history of major psychiatric or neurological disease and a normal electroencephalography (EEG) underwent three 12-min, 256-channel EEG resting-state recordings with the eyes closed (Netstation 200; EGI, Oregon, sampling rate 1000 Hz, reference electrode: Cz) on three separate occasions: at baseline (T0), 1 (T1), and 2 (T2) years later. Written informed consent was provided by all study participants. The study was approved by the local ethics committee (Ethikkommission beider Basel, Nr. 74/09).
Preprocessing
The recorded data were processed exclusively with MATLAB® using the toolbox TAPEEG [Tool for Automated Processing of EEG data (Hatz et al., 2015)].
Briefly, selection of segments was performed according to user-independent criteria: After band pass filtering to 1–70 Hz, the data were split into periods of 4 sec for analysis. Channels of every period were automatically analyzed, and segments with minimal length of 36 sec (summed length >180 sec) with lowest bad channel load and highest ratio of peak frequency amplitude to minimal amplitude (peak2min) were generated. Raw data of selected segments were filtered with a high-order, linear-phase, finite impulse response filter (MATLAB: Firls, 0.5–70 and 50 Hz notch, filter order: 4.8 × sampling rate). Channels with >70 percent bad periods were excluded and an independent component analysis for all segments was performed to exclude bad activations. Consecutively, bad channels were interpolated [spherical spline method (Perrin et al., 1989)]. Each subject's segments were stitched, using a 1-sec inverse Hanning window at intersections and reduced to 214 electrodes. For quality control, a frequency analysis was performed and three criteria for rejection were automatically applied: amplitude of peak frequency, peak2min, and band power at 4–14 Hz with thresholds 0.7, 1.9, and 0.8, respectively. EEG was included if two criteria were above the threshold and the amount of automated detected bad channels and activations was below 10% (Hatz et al., 2015).
Microstate segmentation
Microstates were segmented in the baseline EEG (T0). The global field power (GFP) was calculated as standard deviation of the data at each time point (Michel and Lehmann, 1993; Murray et al., 2008):
n = number of channels
u = voltage distribution at time point t
Only samples of local maxima of GFP were selected and a modified version of classical K-means clustering with squared correlation as distance measure was used to obtain the most representative topographies (Pascual-Marqui et al., 1995). K-means clustering was calculated for results with 2–20 clusters and the optimal number of clusters was defined using L-curve via an adaptive pruning algorithm on the Krzanovski–Lai criterion (Krzanowski and Lai, 1988), resulting in four different microstates as optimum (Fig. 1). By fitting the individual data of all three time points (T0, T1, and T2) to the four template microstates, using a temporal smoothing with window size = 12 ms (Pascual-Marqui et al., 1995), a vector was created for all three time points of every subject, competitively labeling every time point to one of the four microstate classes.

Four microstates derived from K-means clustering on baseline electroencephalography using squared correlation as distance measure. Note also that the polarity (color code) is chosen arbitrarily, as microstates are independent from polarity switches that might occur within topographies.
Functional connectivity
For measuring connectivity, the PLI was selected, as this index is known to be little affected by the effect of volume conduction compared to other classical connectivity measures such as synchronization likelihood, phase coherence, or envelope correlation (Stam and van Dijk, 2002). PLI measures were first calculated in a standard approach. Data were filtered using a Butterworth filter to four predefined frequency bands (theta: 4–8 Hz, alpha1: 8–10 Hz, alpha2: 10–13 Hz, and beta: 13–30 Hz). Phase estimation was archived using a Hilbert transformation. The phase difference distribution was obtained from a time series (t1−tk) of phase differences (Δφ) between two signals and the asymmetry of the phase difference distribution was calculated as described by Stam et al. (2007).
(where k = number of samples, Δφ = phase differences between two channels)
PLI was calculated using four epochs of 4 sec and averaging the resulting matrix into one matrix per subject and frequency band, as described previously (Hardmeier et al., 2014).
Functional connectivity within microstate-segmented PLI
In the second approach, PLI was calculated within microstate segments. To this aim, the Hilbert transformation was applied to the full-length EEG using a sliding window of 4 sec with a 50% Hanning window. For every microstate class, four stitched periods of each 4000-phase difference ( = 4 sec) were then extracted, using the time frames indicated by the microstate label vector and automated routines to detect microstates with lowest number of bad channels and avoiding periods were inverse Hanning windows were applied. The number of four epochs per microstate, subject, and frequency band was selected, as this minimal amount of epochs per microstate was available in almost all EEG, given the recording time of EEG data. Like in the standard approach, the four resulting matrices (one per frequency band) were averaged per subject, for each microstate class separately.
Graph measures
Graph measures were calculated according to the formulas in Table 1: average clustering coefficient (Cw), average shortest path length (Lw), degree diversity (Kw), and degree correlation (Rw) were selected. Cw stands for the amount of local connectivity, and Lw for the optimization of distant connectivities. The lower the Lw and the higher the Cw, the more information flow is optimized toward a small world network. Kw represents the distribution of degrees; a high value stands for a network with only a few highly connected nodes, also called hubs. Degree correlation represents a similar characteristic and is high when nodes with similar degrees are connected. Moreover, graph measures derived from a minimum spanning tree (MST) analysis were added. MST analysis allows to identify the “main roads” of a network, reduces the network to a fixed number of connections (number of nodes minus one), and simplifies statistical analysis and comparison of different networks.
MST, minimum spanning tree.
Statistics
For statistical analysis, the 214 electrodes were grouped into 22 regions across the scalp (11 regions per hemisphere), excluding electrodes in the midline, neck, and face (Hardmeier et al., 2014); for connectivities, the average was taken of all connectivities connecting two regions.
Graphs of the different microstates and frequency bands were compared using a Pearson correlation analysis (subjectwise). Cross-sectional intersubject variability was expressed as the coefficient of variation (CoV) calculated as the ratio between the standard deviation and the mean of the PLI/msPLI. For TRT reliability over the three time points (i.e., across annual EEG recordings), intraclass correlation coefficients (ICC) were calculated (Efron, 1979). A bootstrapping procedure with 10,000 permutations was performed to estimate the 95% confidence interval (95% CI) for both indices and correcting artifacts of ICC. In accordance with previous studies, reliability was categorized as “excellent” if ICC >0.75, as “good” if ICC: 0.60–0.75, as “fair” if ICC: 0.40–0.60, and as “poor” if ICC <0.40 (Deuker et al., 2009; Jin et al., 2011; Wang et al., 2011).
The Code of TAPEEG, including the combined analysis of microstates and connectivity, is freely available under “
Results
Preprocessing
The automated analysis of quality criteria rejected at least one recording in the EEG of six subjects, resulting in a remaining total of 34 subjects for further analysis.
Microstate analysis revealed an optimal clustering number of four microstates (Fig. 1), consistent with previous literature (Khanna et al., 2015; Koenig, 2002). Global explained variance for the result with four microstates was 0.67. Median durations and lower–upper quartiles of single microstates are given in Supplementary Table S1 (Supplementary Data are available online at
Connectivity analysis
Results are given in Figure 2 and Tables 2, 3.

Connectomes of single microstates averaged across all participants. Connectomes were reduced to 22 nodes. Only the 20% strongest links are plotted. Nodal degrees are shown in red.
Median values were selected, as resulting ICC values were not normally distributed; parenthesis: lower and upper quartile of bootstrapping analysis; bold: “excellent” TRT.
ICC, intraclass correlation coefficients.
Parenthesis: lower and upper quartile of bootstrapping analysis; bold: “excellent” CoV.
CoV, coefficient of variation.
For visual comparison of the different network graphs derived from averaging, the four connectomes per frequency band and microstate were transformed to binary graphs, setting only the 20% strongest links to one, averaged across subjects, and again thresholded at 20% strongest connections (Fig. 2). Pearson correlation of the graphs in the different frequency bands and microstates revealed higher correlations within each frequency band across different microstates than between the different frequency bands for the same microstate (Fig. 3). The correlations were significant for the microstates in the alpha1, alpha2, and beta band. No significant correlations between the graphs of different frequency bands were found.

Connectomes were reduced to 22 nodes and Pearson correlations were calculated subject wise. Mean correlation values are shown in the lower, mean p-values in the upper triangle. In the upper triangle, only p-values ≤0.05 are shown, in the lower triangle, all correlation values are shown regardless the significance.
After reduction of the connectivity results to 22 regions, the regional-link analysis showed “excellent” TRT reliability across the three time points T0, T1, and T2 (median ICC >0.75) for theta connectivity in microstate 4, for alpha1 and alpha2 connectivity in microstate 3, and “good” TRT reliability (median ICC >0.6) for other theta, alpha1, and alpha2 microstate connectivities (Table 2). The ICC results of the regional-link analysis without microstates were distinctly lower (Table 2 and Supplementary Fig. S1). CoV results are provided in Table 3.
Graph analysis
Results of graph measures analysis are listed in Table 4. In the theta, alpha1, and alpha2 band, “good” to “excellent” TRT reliability is seen for Cw, Lw, and Kw, but not for gamma, lambda, Rw, or results in the beta band and graph measures derived from MST analysis (Supplementary Table S2).
Parenthesis: lower and upper quartile of bootstrapping analysis; bold: “excellent” TRT.
Discussion
Microstate-segmented PLI (msPLI) revealed a significantly higher TRT reliability over a 2-year follow-up period for single-link and regional-link analysis compared to standard PLI analysis. A possible explanation is that during each microstate, EEG data have a higher stationarity than without such segmentation. Microstates are defined by stable topographies of electrical brain activity with a mean duration of around 100 ms, explained by synchronized oscillations (Koenig et al., 2005). According to Koenig, two physical models of the microstates have been proposed. In a first model, a single strong electrical source causes the corresponding microstate by volume conduction. In a second model, multiple active electrical sources are operating in a phase-locked, zero-lag mode. By applying inverse solution techniques, each microstate can geometrically be reduced to the activity of circumscribed regions deeper in the brain (Pascual-Marqui et al., 2014). These regions are too extended to represent single electrical sources, and the concept of a deeper located group of sources with phase-locked, zero-lag synchronization is more likely. As a consequence, the part of electrical activity explaining the microstate measured at a single electrode would not necessarily be generated in direct proximity of the electrode. Therefore, brain activity as measured at a single electrode is composed of signals from many simultaneously active electrical sources, among them those defining the microstate. As the activity of more superficially located electrical sources is not as equally propagated to all electrodes as the activity of deeper electrical sources, the more superficially located activity is more likely to produce signals with phase shifts between single electrodes. By calculating the PLI (Stam et al., 2007), only the consistency of these phase shifts is evaluated, excluding information of oscillations without phase shifts (nonlagged oscillations).
In this study, PLI was calculated for the four different microstates separately by segmenting the phase information according to the initial microstate segmentation. The higher TRT reliability compared to classical PLI analysis is an argument for connectivity being influenced by microstates and for the concept of subnetworks depicted in single microstates (Stam and van Straaten, 2012b): the corresponding subnetwork shows increased activity during a single microstate when compared with other subnetworks, leading to a better signal-to-noise ratio for a reliable detection of a true phase consistency and resulting in the identification of links with stronger connectivity values than in an analysis without microstates. The larger range of connectivity values may explain the higher CoV values in msPLI compared with classical PLI evaluation. TRT reliability in regional-link compared to single-link analysis is distinctly higher. This may be explained by the individual placements of the high-resolution EEG nets for every single investigation, leading to small shifts of single electrodes from one to the next recording. Thinking about a maximal shift of one electrode, we decided to average the electrodes to 22 groups. The number of 22 groups is a compromise between avoiding too much obvious volume conduction effects while still representing some functional regions, even in some more detail than in Berendse and Stam (2007) using 10 regions.
Derived graph measures from msPLI connectivity results are also showing improved reliability over time compared with the analysis without microstates. Interestingly, “good” to “excellent” reliability was found for the average clustering coefficient (Cw), average shortest path length (Lw), and degree diversity, but not for normalized Cw and Lw, degree correlation (also called assortativity), or results of the MST analysis. In comparison with single-link analysis, graph measures showed lower reliability. However, these results are not easily comparable, as for single-link analysis, we reduced the number of links using a mapping to 22 regions, whereas graph analysis was performed on the full matrices with 214 nodes. By not reducing the number of nodes, the signal-to-noise ratio and consecutively the TRT reliability may be lower. A graph analysis on data with a reduced number of links is not included, as interpretation of graph results derived from connectomes with a low number of nodes is difficult. Lower connectivity values are hardly distinguishable from noise and only averaging of a larger number of matrices in the same subject improves the signal-to-noise ratio. This fact may explain the very low reliability for MST-derived graph measures. MST analysis depends partially on reliable lower connectivity strengths, as in MST, every node has at least one connection (degree ≥1). In the present study, only four matrices per subject and microstate could be included, due to the maximal available amount of recorded and preprocessed data (12-min continuous resting-state data per subject). Recording longer periods of continuous resting-state EEG may be increasingly contaminated with instable vigilance. For this reason, msPLI analysis will be limited to an analysis with a reduced set of matrices per subject and microstate, and it will be of reduced value for analyses that significantly depend on reliable lower connectivity strengths such as MST analysis.
Some earlier studies also reported on TRT reliability of connectivity and graph measures. For eyes closed resting-state analysis, Jin and colleagues reported fair to moderate TRT reliability for different nodal centrality measures at a test–retest interval of 2 weeks using magnetoencephalography (MEG) and mutual information as connectivity measure. TRT reliability was partly higher in the eyes-open compared to the eyes-closed resting state, and much lower in the gamma band (Jin et al., 2011). Deuker et al. (2009) recorded eyes-closed resting-state MEG data at a test–retest interval of 4–6 weeks and reported an excellent ICC for Lw and “efficiency” ( = 1/Lw) and a good ICC for Kw in a broader alpha band (7.8–15.6 Hz). However, these studies are not directly comparable to the present study as EEG and MEG are picking up signals from different sources (gyri vs. sulci) (Malmivuo, 2012) and mutual information reflects other aspects of connectivity between two regions as phase-derived graph measures, including the PLI (David et al., 2004).
The present observation of significantly high correlations between the connectomes of the four different microstates in every frequency band is another argument in favor of the assumption that microstates reflect activity of different subnetworks in a global network (Stam and van Straaten, 2012b; Van De Ville et al., 2010). This assumption is in line with previous work, describing all microstates as a part of the default mode network in resting state (Pascual-Marqui et al., 2014), or, alternatively, that four distinct networks are continuously active with changing intensity (Britz et al., 2010; Musso et al., 2010). Differentiating between the two possibilities is not possible based on the results of this study.
A synthesis of both possibilities might be the concept of the brain as one large network with dedicated subnetworks for different tasks, which are simultaneously active and interdependent to a certain degree.
However, thinking about the brain as one large network with dedicated subnetworks for different tasks, it is more convincing to believe in the approach where all simultaneously active networks are at a certain degree linked and dependent. The observation of stronger ICC results in the alpha1 and alpha2 band is most probably explained by the paradigm selected for the study. EEG was recorded in resting state with eyes closed, leading to most prominent power in the alpha band. Since PLI strength is positively correlated with power results (unpublished data of our group), the increased ICC values in alpha1 and alpha2 bands are likely the result of the higher PLI values in these bands. The same relationship may also explain the reduced ICC values in the beta band, as the power in the beta band is lowest, reducing the signal-to-noise ratio for the PLI calculation. In addition, the fact of averaging only four matrices per microstate leads to difficult interpretation of results in the beta band.
Validation studies combining microstates and connectivity analysis in patient cohorts are warranted.
Conclusion
The combination of microstates and phase-derived connectivity measures such as PLI improves reliability of single-link, regional-link, and graph analysis.
Footnotes
Acknowledgments
We thank the participating subjects. The financial support of the Swiss National Science Foundation (Grant Nos.: 33CM30_140338, 33CM30_124115; 326030-128775/1), Synapsis/Parrotia Foundation, Novartis Research Foundation, Freiwillige Akademische Gesellschaft Basel, Mach-Gaensslen Foundation, and Swiss Multiple Sclerosis Society is gratefully acknowledged.
Author Disclosure Statement
No competing financial interests exist.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
