Abstract

Looking Back to Look Forward
This is the fourth (and last) volume of Sociological Methodology (SM) at Pennsylvania State University under the editorship of Duane F. Alwin and the second for Ashton M. Verdery as Deputy Editor. 1 Our editorial team, along with an additional cast of characters to be implicated in the following, has worked hard to maintain the high standards held by SM. As a testament to these claims, we should first note that we have been blessed with a dedicated and highly qualified editorial board and a large number of outstanding extramural reviewers—as listed in the preceding pages. With their help, we believe SM is successfully tackling its mission: Subscriptions have increased, submissions are healthy, and the trend is upward. Significantly, the impact or influence of SM has been rising over the past decades. We are pleased to be able to pass along a healthy journal to our capable successors, co-editors David Melamed and Michael Vuolo of Ohio State University, who assume full editorial responsibilities for the next several volumes.
In this prologue, we take advantage of the opportunity to provide a brief accounting of the performance of SM over the past several years, using available data contributed by the ASA, assessed over the 12-year window, from 2007 to the present. We also present information from Journal Citation Reports for journal impact (two-year and five-year impact scores and Clarivate article influence scores). In part as a consideration of trends in the topics of past SM articles, we also reflect on the past by briefly discussing “SM’s Greatest Hits,” as measured by article citation counts. Finally, we provide our usual overview of the contents of this volume, emphasizing the contributions of the articles to sociological methodology, in addition to a section on acknowledgments.
In passing the editorship of SM, we focus briefly not only on the current state of SM but also on its trajectory over recent years on several indicators: subscriptions, publications, submissions, acceptance rates, and article impact scores (see Table 1). 2
Compilation of Information on the Trajectories of Sociological Methodology
Note: Year = copyright year; Subs = number of subscriptions (from ASA); Pubs = number of articles published (from ASA); Ms = number of papers submitted (from ASA); Rate = acceptance rate (from ASA—final decisions only); 2-year impact = 2-year impact score (from Journal Citation Report, Clarivate Analytics, n.d.); 5-year impact – 5-year impact score (from Journal Citation Report, Clarivate Analytics, n.d.); Article influence = article influence score (from Journal Citation Report, Clarivate Analytics, n.d.).
Year Sociological Methodology transitioned to SAGE Publishing.
Recalculated values for 2-year and 5-year impact scores, because Clarivate Analytics (n.d.) counted 12 articles for 2016 (there were only 10). Also, they counted 8 publications in 2015 when there were 9. We cannot recalculate the article influence score for 2018 given the incorrect numbers used in the denominators of the 2-year and 5-year scores.
It is difficult to measure trends in subscriptions from the available data because of historical differences in reporting these numbers, and we do not include the number of institutions in developing countries that receive free access from SAGE (currently about 5,200). Since 2012, when SM transitioned to SAGE, SM subscriptions have increased. These numbers primarily reflect the growth of institutional subscriptions. Member subscriptions have declined over this period, presumably due in part to the fact that since 2013, ASA membership includes online access to all ASA journals. Although these numbers may not reflect readership per se, they are an indication of the journal’s growing visibility.
As shown in Table 1, the number of articles published in SM each year ranges from 7 to 11, with an average around 10 (commentaries and rejoinders are not included in these numbers). Currently, to the extent possible, we limit SM publications to approximately 6,000 to 8,000 words of primary text (i.e., excluding references, tables, figures, and other visual content), although we maintain a degree of flexibility. Given only 400 published pages to work with, it is not always possible to publish appendices. This has been accommodated by allowing authors to publish lengthy appendices as supplemental online files.
The number of manuscripts submitted for review has also increased over time, although these numbers are quite variable. In the most recent period, 2014 to the present, the number ranges from about 50 to 70. Similarly, acceptance rates depend intimately on what enters into the calculation (ASA has changed its definitions over time), but using comparable numbers, the acceptance rate for SM averages roughly 12 to 13 percent in recent years. This may not convey much about SM given our published manuscripts typically undergo at least three rounds of submission (i.e., receiving two “revise and resubmit” decisions) prior to acceptance.
There is ample evidence SM has demonstrated a positive trajectory on a number of dimensions, and there is no dearth of information on the fact SM has published and is committed to the publication of a diverse set of methodological innovations. Clear efforts have been taken to include greater breadth in coverage of innovations across the range of research methods, including a wide variety of qualitative methods popular in the discipline. This is not always easy to achieve given SM is a submissions-based periodical and an annual publication without much flexibility to devote “special issues” to selected topics. However, past and present editors are committed to increasing the number of submissions from all areas of methodological innovation in the discipline.
Critical to these potentialities is the inclusion of symposia that provide a rare opportunity for comment, critique, and exchange on important methodological topics. Without going into detail about the positive impacts these special symposia have on the field of sociological methodology, we believe this mechanism permits the editor(s) to initiate conversations on important trends in methodology. The development of such symposia can be accomplished without taking away from the normal submission-based publication needs of the journal while at the same time giving voice to diverse views on relevant issues that are important to sociological research constituencies.
Impact of SM
Impact measures reported in Table 1 are based on citation counts and impact reports published in Journal Citation Reports (JCR), measures that have been used by Science Citation Index for decades. 3 The measures for comparing journals with respect to impact rely primarily on the frequency with which articles in a given journal are cited relative to the number of published articles in the journal. Specifically, the Thomson Reuters “impact factor” measures the frequency with which the “average article” in a journal has been cited in a particular time period. It is calculated as a ratio, namely, “the ratio between citations and recent citable items published” (see http://wokinfo.com/essays/impact-factor/). For example, the standard (two-year) impact factor for a given journal, say SM, in 2018 is calculated as the ratio B/C, where B = total number of citations contained in all journal articles during 2018 to SM articles published during 2017 and 2016 and C = number of articles published by SM during 2017 and 2016. Note that the journals containing articles citing SM (in the numerator) are not just sociology journals. 4
This (two-year) impact score is not a precise measure of SM article impact because SM’s annual issue has, for many years, been published in August of each year. 5 The five-year impact score is perhaps a more valid indicator of SM’s impact since sociological methodology tends to be absorbed by the social sciences relatively slowly. To calculate the five-year 2018 impact score for SM, one would form the ratio A/B, where A = total number of citations contained in all journals during 2018 to SM articles published 2013 to 2017 and B = number of articles published by SM during 2013 to 2017. The justification for using the five-year measure is that in fields with relatively long publication lags, few articles will cite papers published in the two previous years. So although the two-year impact factor might work well for physics and chemistry, it works poorly for sociology. Moreover, there is a lot of variation in the two-year impact factor for sociology journals, especially small ones. In the past, this may have led to a “big jump” in the impact factor for a journal being touted as evidence that the journal has become one of the “top journals in sociology.”
In addition to these measures of journal impact, we also use an “article influence score” (see Table 1). This index is provided by Eigenfactor.org and measures the average influence, per article, of the articles in a journal over the first five years after publication. The Eigenfactor score calculation is based on the number of times articles published in the past five years have been cited in the JCR year, but it also considers which journals contributed these citations, so citations from highly cited journals will affect the scores more than those from lesser cited journals. It essentially weights citations by the impact scores of the journals citing them. The Eigenfactor score is normed so the mean article in the entire Thomson JCR database has an article influence of 1.0. A score greater than 1.0 indicates that each article in the journal has above-average influence. A score less than 1.0 indicates that each article in the journal has below-average influence. It is comparable to the widely used Thomson Scientific five-year impact factor mentioned previously. References from one article in a journal to another article from the same journal are removed, so Eigenfactor scores are not influenced by journal self-citation.
The impact scores displayed in Table 1 indicate, regardless of the particular indicator, the impact/influence of SM has generally increased over the period considered. We also examined article influence scores for three additional sociology journals, Sociological Methods and Research (SMR), the American Sociological Review (ASR), and the American Journal of Sociology (AJS). The latter two are broadly considered the top (open submission) substantive journals in U.S. sociology. These results (not presented here) show trends from 2007 to 2018 for these four journals (see Alwin and Winship 2018). The impact of the two major substantive journals—ASR and AJS—improved slightly during this period, moving from about 4.0 in 2007 to 4.5 in 2016, but what is most remarkable are the gains in the article influence scores of the two methodology journals. The impact data from 2007 to 2016, the most recent years for which journal impact data are available, suggest the impact scores of both SM and SMR have improved considerably, taken individually or in comparison to ASR and AJS. SM and SMR have article influence scores around 2.0 through 2012. SM influence scores began to increase to around 3.0 by 2015 and 2016. SMR also improved over this period, and since 2016, SMR exceeded SM in its influence score. Of course, it is difficult to completely understand the true nature of this change, but using the “inter-ocular trauma test,” the results appear “significant,” or noteworthy at the least. We conclude that sociological methodology as a field shows remarkable strength. 6
SM’s Greatest Hits
Generally, SM does not publish substantive research but focuses on methodology (i.e., research on methods), especially methodological innovations. Historically, SM tends not to publish review articles, applications of research methods, or demonstrations of methodological tools. It does, however, publish commentaries, debates, and symposia on recent methodological developments across the broad spectrum of sociological methodology. The journal thus endeavors to restrict its publications to contributions that provide breakthroughs or newly developed research strategies, which will ultimately improve sociological research. It is sometimes said that SM does not publish enough research on certain topics or enough research that uses certain types of methodologies, but this perception rests, we would argue, on a fundamental misunderstanding of the mission and accomplishments of SM.
We do not have data on the performance of SM beyond the past 12 or so years, as shown in Table 1, but one way to evaluate SM’s influence on the field is to look at citation counts of SM articles over time. Citation counts measure the impact of the articles published as well as trends in the field of sociological methodology. There may be other ways to register the impact of published papers, but it is now becoming standard practice to evaluate papers and entire journals using citation counts. Most articles published in SM are cited fewer than 25 times, many are cited less than 10 times, and some not at all. 7 However, several have received high citation counts, which we refer to as “greatest hits,” as defined by the number of citations recorded by Web of Science and Google Scholar. 8 We examined data for all SM articles published in volumes 1 through 45, as of April 2019 (full data set available on request), to find our greatest hits (see supplemental online appendix).
It is safe to say that citations have grown exponentially over the past 50 years. The number of scientific journals and published articles has grown, the number of scientists has increased, and the potential to publish and be cited by others has increased. From this point of view, more recent publications may be more likely to be cited, but this is offset by longevity. Naturally, the longer an article has been around, the more likely it is to be cited, although that depends more on its popularity than it does on time per se. To control for time, we compared articles within what we call a particular “era” of sociological methodology, and by doing so, we were better able to evaluate the most influential SM articles.
Historically, SM has focused on innovations in sampling, measurement, text analysis, comparative analysis, quantitative modeling and model selection, statistical inference, social network analysis, and a range of other topics. Some topics have been especially popular at different times. In SM’s formative years (1969–1984), the articles with the most impact had to do with path analysis and structural equation modeling (see Heise 1968; Land 1968). Mike Sobel’s (1982) classic article on confidence intervals for indirect effects in structural equation models is the most highly cited SM article of all time—13,690 Google cites as of this moment! In addition, see Sobel (1986), also a greatest hit, which has been cited 1,000 times.
With its rise to maturity in the latter part of the twentieth century, SM continued to be the site for publication of innovations in event history measurement and modeling (e.g., Allison 1982; Petersen 1995) as well as in structural equation models (e.g., Muthén and Satorra 1995; Wheaton et al. 1977). During this period, the influences of Bayesian approaches became more prominent, and one of the most highly cited articles published in SM was Adrian Raftery’s (1995) famous Bayesian information criterion (BIC) article.
SM was also the site of some early papers on methods for social network analysis (e.g., Tom Snijders’s [2001] important work on social network dynamics), an area that has continued to be an important focus of the journal. In the most recent period, SM network articles have continued to be highly visible, and several new areas have seen substantial attention, as seen in citation counts. SM has been especially instrumental in publishing influential papers on respondent-driven sampling, the issues surrounding it, and related approaches (see Salganik and Heckathorn [2004] and the work that followed).
The topic of incomplete or “missing” data has been of interest across the past several years (von Hippel 2007; Yuan and Bentler 2000). And measurement issues have been a constant focus of highly cited SM articles across all time periods, including measurement of occupational prestige and socioeconomic indexes of occupations (Hauser and Warren 1997; Nakao and Treas 1994), measurement of spatial segregation (Reardon and Firebaugh 2002), life history and life course measurement (Freedman et al. 1988), and the systematic observation of neighborhoods, otherwise known as “ecometrics” (Raudenbush and Sampson 1999).
Finally, to examine the impact of SM articles in the future, a critical indicator of impact or influence would be the number of downloads. Especially for the most recent articles published, downloads are possibly a superior indicator of influence. To satisfy our curiosity, we performed a preliminary analysis of the top downloads over the past dozen years (2007–2016). Our measure captures the number of times SAGE reports the article being downloaded, starting in December 2016 (thus covering, roughly, a two-and-a-half-year period). Of course, truncation of the measurement period may introduce bias and lead to an inaccurate picture of the “true” influence or popularity; we see some evidence this is occurring, with a .32 correlation between citations and year and a linear relationship implying 45.7 additional downloads per year. It is interesting to explore which types of articles are garnering the most attention, but we do not examine these issues here, except to note that many of the top 25 most downloaded articles deal with qualitative methods or textual analysis.
Quantitative and Qualitative Methods
SM publishes across the range of methodological strategies, from various aspects of qualitative methodology to quantitative methodology, although its publications are more variegated than is conveyed by this gross categorization. This is in many ways a false dichotomy given orthodox or traditional qualitative and quantitative strategies are increasingly combined or even side-stepped by new and fundamentally different approaches (e.g., computational analysis of text or interview data using topic modeling procedures) so that any given contribution may include a variety of components and thus may not be amenable to such classification. Indeed, it is our sense that several of the most prominent areas of methodological innovation in the discipline are occurring at the intersection of what were traditionally thought of as qualitative and quantitative data (e.g., text analysis, retrospective life history calendars, and diary methods).
As already mentioned, except for commentaries included in symposia, SM is an open-submission journal, that is, there is no solicitation of papers. SM publishes the best work it receives, regardless of area but in all instances consistent with the journal’s focus. Symposia are built around submitted papers. The overall focus is on methodological innovations, and innovation is not always communicated with the same rate or frequency across all methods. There has been some complaint that SM does not publish articles on qualitative methods. This is perhaps because of a lack of familiarity with the content of SM. This may also be due to a misunderstanding of SM’s mission on the part of sociological researchers, who may not submit innovations in qualitative methodology because they assume sociological methodology is only about statistical or quantitative approaches. Or, perhaps the methodological innovations in qualitative research are more often passed down through word-of-mouth connections and in graduate training rather than through formal, peer-reviewed publication. Our own subjective assessment is that SM has published a remarkable balance between qualitative and quantitative methods over the years. The list of articles published in the past 12 years of SM reveals some insight into recent trends, especially with regard to observations that are sometimes made. SM publishes a great deal of content dealing with innovations in qualitative methods. Symposia dealing with qualitative methodologies include the symposium on qualitative data analysis (2012), the symposium on qualitative comparative analysis (2014), and the symposium on interviewing, which included an article on qualitative interviewing (2016). The inclusion of qualitative emphases in the present symposium (see the following) represents a fourth devoted to qualitative methods in recent years. Throughout the past 12-year window, articles have covered topics such as qualitative interviewing, semistructured interviewing, ethnography, text analysis, nonprobability respondent-driven sampling methods, approaches to content coding, qualitative data analysis, and qualitative comparative analysis. Finally, all past and present editors have had a commitment to publishing the most recent innovations in qualitative sociological methodology, and this commitment is demonstrated by examining the list of content published in SM.
This Volume
The present volume of SM includes articles dealing with a wide range of methodological topics, including measurement problems in the study of social movements, deep learning approaches to the analysis of image and text data, natural language processing, statistical model comparisons, multilevel modeling issues, and models for diffusion in social networks, along with notes and comments on previous SM publications.
Symposium on Big Data—Innovations in Computational Social Science
With the development of the Internet and the worldwide web, our lives have become inundated with vast amounts of information—we live in a “big information” or “big data” era. Take sports, where all sorts of new information is available on player performance (see e.g., Epstein 2013). Michael Lewis’s (2003) popular book Money Ball (as well as the movie), for example, highlights the work of SABR-metrician Bill James (SABR = Society for American Baseball Research) in the development of new “metrics” to evaluate player success in major league baseball (see Albert 2010). This occurred in other sports as well.
More and more information is available to be analyzed, which places demands on innovative analytic techniques. In science, developments can be seen in the availability of huge databases. In genomics, space exploration, marine biology, and nearly every science, researchers are confronted with big data (Pearl 2018). Even closer to home, in the social sciences, we have become inundated with data. U.S. federal census data are increasingly available in one form or another, including images of the original census enumerations (see e.g., Ancestry.com). Maintaining the traditional standards for collection of population-based big data, the Census Bureau and Bureau of Labor Statistics collaborate to produce a number of public data sets on the U.S. population, including the quarterly CPS (Current Population Surveys). Another example is the American Community Survey (ACS), an ongoing survey of the Census Bureau, which includes more than 3.5 million respondents per year (U.S. Census Bureau 2015). Several cohort studies of labor force participants are increasingly available (see Bureau of Labor Statistics 2019). And with the continuation of many existing sources of social monitoring data (e.g., the U.S. General Social Survey, the German ALLBUS, the British Social Attitudes Study, and the World Values Survey, to name just a few) as well as new large-scale multinational data projects, such as the European Social Survey or the International Social Survey Program (ISSP), along with the digitization of many archival records, new demands are being placed on how we approach data—social science data are bigger than ever before. The bottom line here is that “big data social science” has arrived, and it will put major demands not only on how we think about testing hypotheses about society but also on how we develop curricula for training students, how we use computers, and how we communicate with one another. Ultimately, the hope is that big data will help us answer big questions.
One tantalizing example of a massive new big data resource is Google Ngrams (see Michel et al. 2011), an ambitious collaboration between Google and many of the world’s largest libraries and book publishers, which intends to digitize every existing book ever printed. The corpus currently contains 361 billion words dating back five centuries to the 1500s. The more than 15 million scanned and digitized books, in several languages, represent approximately 12 percent of books ever published. The process of developing the Ngram database is straightforward. Each book is scanned, digitized via optical character recognition, and the extracted words are added to the Ngram database along with key metadata. The Ngram database contains all words appearing at least 40 times in the entire corpus; it includes the year in which the word appeared in a published text, overall frequency of occurrence, number of pages on which the word appeared, and number of unique books containing the word. To date, Google has supplied searchable American English databases (one-gram, two-gram, three-gram, four-gram, and five-gram databases). The one-gram database contains single words (e.g., fish), the two-gram database contains combinations of up to two words that were adjacent to each other in the scanned books (e.g., catching fish), the three-gram database contains combinations of three adjacent words (e.g., likes catching fish), and so on. The potential of Google Ngrams for the study of society is great, and some scholars have used the term culturomics to refer to the use of these new digitized resources in the study of trends, regional differences, and the human cultural genome (Michel et al. 2011). 9 In the area of methodology, one could, for example, examine historical trends in the use of methodological or scientific terms like survey, sample, experiment, statistic, empiricism, validity, causation, or placebo (see Alwin 2013).
New approaches that merge computer science, visualization tools, and traditional statistics are being developed to accommodate these growing resources, and in keeping with these observations, this year’s symposium highlights the exploration of nontraditional data resources, including text and image data from social media, and the computational analysis of textual meaning using natural language processing. The term big data has come to be associated with many of the new large data sets; the authors of the two articles included here, along with the commentaries and replies, focus on examples of how big data resources can be used to promote new understandings in traditional research areas. In addition to considering big data issues, these two papers also embody a modern approach to data analysis involving “computational social science” (e.g., Evans and Aceves 2016; Nardulli, Althaus, and Hayes 2015; Nelson 2017). The existence of large amounts of text have created a “demand for natural language processing and machine learning tools to filter, search and translate text into valuable data” (Evans and Aceves 2016:21).
The symposium focuses on potentially new resources for sociology, including automated analysis of image and text data from social media and data mining of newspaper text using natural language processing. As these articles reveal, machine learning and data mining approaches are at the cutting edge of methodological developments in sociology and social science more generally, and they are squarely in the intersection of what is traditionally thought of as a divide between quantitative and qualitative data. “CASM: A Deep-Learning Approach for Identifying Collective Action Events with Text and Image Data from Social Media,” by Han Zhang and Jennifer Pan (hereafter ZP), goes beyond just text. They analyze collective action events in China using both image and text data. The study of social movements in general and the study of protest events in particular traditionally rely on news media reports as a source of data (see e.g., Martin, Rafail, and McCarthy 2017). The authors introduce a new approach—CASM (collective action from social media)—tailored to the study of collective action and protest events that happen in authoritarian regimes and that are not reported in traditional media. CASM uses text and images from social media data and deep learning algorithms to identify offline collective action events in real time. This approach harnesses digital technologies and deep learning algorithms to reveal posts that likely discuss offline collective action events. Deep learning (or deep structured learning or hierarchical learning) is a branch of supervised machine learning that uses a framework of artificial neural networks to create algorithms that are “trained” to identify collection action events.
Deep learning algorithms are just beginning to be used in social science research, in particular in research using image data. ZP's innovative use of these methods expands on this emerging strand of social science research by using deep learning for image and textual classification. This article should also be read against the backdrop of the fact that the Chinese government places restrictions on traditional and new media to limit subversion of its authority. The article implements CASM for China using social media data from Sina Weibo, the largest social media platform in China, with over 445 million monthly active users. ZP describe the advantages and disadvantages of using social media as a data source for identifying collective action events, and they explore the external validity of this target data source with other event data sets. The article also evaluates the impact of censorship on the quality of the CASM-China data, and it considers how computer science methods can be made more practical and usable in the social sciences. In summary, deep learning algorithms have helped make significant advances in many machine learning tasks, especially tasks related to analysis of images and text, such as image classification, multiple object detection automated image captioning, voice recognition, and parts-of-speech tagging.
The commentaries on the ZP article, by Swen Hutter, Pamela Oliver, and Zachary C. Steinert-Threlkeld, offer a great deal of praise for these new approaches to protest-event analysis but not without some cautionary admonitions. The commentaries agree that ZP have given the sociological community three “gifts” (in Hutter’s words): (1) a two-stage classifier of text and images, (2) a large data set on offline protests reported online during a specific historical period in China, and (3) an accessible text that explains their methods. On the critical side, Hutter focuses on the conceptual issues of the coding unit, the identification of duplicates, and relations between online and offline dynamics. Oliver similarly lauds ZP’s contributions. She observes the potential for human error and discusses the various advantages and disadvantages to strategies of compiling protest event data. Steinert-Threlkeld suggests ZP’s work represents “the new frontier” in the study of collective action. He highlights their contributions in including image data in addition to text and holds that the debate about the quality (bias) of various types of event protest data (multimodal event data) reflects a promising new approach to the problems of bias. The collective conclusion is that these new methods will assist in the development of automated systems for coding protest events that permit greater accuracy and analytic power. The commentaries agree that ZP’s work is already helping shape future innovations in protest event research, and in the words of Steinert-Threlkeld, “the future of event data is bright, and one of the brightest areas is using social media images” to develop new event protest databases.
The second article in the symposium, “Analyzing Meaning in Big Data: Performing a Map Analysis Using Grammatical Parsing and Topic Modeling” by Jan Goldenstein and Philipp Poschmann (hereafter GP), offers another example of the application of analysis of big data using computational methods that perform text analysis through natural language processing. They perform a “map analysis” of textual meaning in which they use a “layered” approach combining “grammatical parsing” to discover communication structures and “topic modeling” to assess semantic context. We consider the combination of these two approaches to text analysis to be one of the unique contributions of their article. They creatively apply this layered approach to examine how the meaning of the concept of “corporate responsibility” has changed in the United States.
These methodological tools go by a number of terms. Text mining is the most commonly applied term, but this does not necessarily convey the variety of approaches. The underlying technique that enables contemporary approaches to content analysis involves natural language processing (or NLP), which is a field of specialization that focuses on interactions between human language and computational linguistics and artificial intelligence. The authors combine grammatical parsing using NLP tools and topic modeling in what they refer to as a “map analysis” of different layers of meaning. In machine learning and natural language processing, grammatical parsing refers to methods of breaking down a text into its component parts of speech. Topic modeling, on the other hand, is a text-mining tool for the discovery of latent semantic structures in a body of text. Topic modeling is a popular approach to the measurement of cultural meaning (see Mohr, Wagner-Pacifici, and Breiger 2015).
GP’s article provides an overview of the current state of formal text analysis in sociology, pointing to existing sources on the topic. In so doing, the article raises the question of whether text analysis should be free of subjective (ex ante) interpretations. GP also provide an introduction to the concept of “maps” in text analysis (to wit: a map involves analysis of textual characteristics without any subjective assumptions, and it presents a focused version of the entire text corpus from the perspective of the chosen textual characteristics). They present the simultaneous consideration of two layers of textual meaning using this map analysis approach. They first obtain a grammatical parsing that reflects communication structures, and then they follow this with a layer of topic modeling on top of the structural elements.
This article introduces an approach that suggests a combination of text-mining tools that address different layers of meaning. In their empirical application, GP analyze 15,371 newspaper articles published in the New York Times and Washington Post from 1950 to 2013 on the topic of corporate responsibility. They used state-of-the-art Stanford CoreNLP software (Manning et al. 2014) to enable their supervised learning approach to perform grammatical parsing. They then coupled this with their own topic modeling software that implements content analysis of the semantic patterns. This article provides a fully developed NLP software application implementing grammatical parsing and topic modeling and enables the large-scale analysis of communication structures (i.e., semantic triplets) and hidden semantic patterns in texts.
Commentaries by expert text analysts, Burt L. Monroe (a political scientist) and Laura K. Nelson (a sociologist), provide additional perspectives on the GP article. Both Nelson and Monroe raise concerns about the transparency of Goldenstein and Poschmann’s research with respect to their use of current text-mining tools, but they clearly see some value in the approach. Nelson focuses on transparency in the context of research and the analytical steps in a text-analysis project to ensure the reproducibility of results. While maintaining that GP’s research “exemplifies how to expand our text analysis tool-kit by applying computation tools to address an important sociological question,” she takes serious issue with the assumptions on which the approach rests, namely, that computational methods can supplant subjective approaches. Monroe focuses on transparency regarding data inspection and the credibility of results. He suggests the authors may have “fallen prey” to some pernicious measurement traps in the text analytic techniques they apply, and he raises concern that the authors may not have “captured the meaning they assert.” Although the invited commentaries are critical of the approach taken by GP, we conclude that the GP article usefully illustrates how recently introduced methodological frameworks for the study of cultural content can be used in sociology. They show that various tools can be used in combination with multiple layers of text analysis approaches. Ultimately, GP argue for a combination of text-mining approaches with scaling inductively grounded in a “close reading” of content. Any given study has certain limitations, and the area of content analysis (i.e., text mining using computational approaches) has a great deal more disagreement than agreement on the optimal approach. Whatever its limitations, this article illustrates the variety of applications of text mining using various forms of natural language processing, and as a representative of this new genre of research, it portends promising applications of text mining in sociology in the future.
Framework for Model Comparisons
The article in this section by Trenton D. Mize, Long Doan, and J. Scott Long, “A General Framework for Comparing Predictions and Marginal Effects across Models,” tackles the problem of comparing predictions or effects across models, an issue that is receiving substantial attention in the discipline writ large (e.g., Mood 2010; Mustillo, Lizardo, and McVeigh 2018). Statistically oriented social scientists have been pursuing the question of how best to compare across models for at least a half century (e.g., Chow 1960), particularly within nonlinear models and other complex scenarios. Sociological methodologists have long led advances on these sorts of topics (Sobel 1986). In this new advance to the literature, the authors argue for a general and flexible approach to model comparisons that permits comparison of marginal effects across studies or across groups within a study. The framework they propose subsumes a wide range of cross-model comparisons. They begin with a clear statement of the problem, review the importance of assessing marginal effects, and show that these issues often plague researchers trying to draw comparisons across models, studies, and data sets. They propose the use of seemingly unrelated estimation (SUEST) to combine estimates from multiple models, testing the equality of predictions and effects across models. A focus on changes in the predictions—the very thing current sociological research is increasingly attuned to—dovetails with the primary concerns of recent critics of common practice. The article applies this framework to an array of problems for binary, continuous, count, ordinal, and nominal dependent variables.
The seemingly unrelated regression (SUR) model, or seemingly unrelated regression equations (SURE) model, is an approach borrowed from econometrics (Zellner 1962) wherein multiple equations (multiple dependent variables) with potentially different sets of exogenous explanatory variables are analyzed (see Felmlee and Hargens [1988] for an introduction to the approach in sociology). The term seemingly unrelated refers to the fact that each equation can be considered on its own and estimated separately (Hargens 1988). In fact, one might say they are “seemingly related” because the disturbances in these equations are in general correlated. The SUR model is a generalization of a regression model in which there are multiple equations for the outcomes of interest. Mize and colleagues utilize this model to develop tests of cross-model differences, and to demonstrate the utility of the approach, they present examples and applications that illustrate how their framework can be used in a variety of concrete situations. Importantly, their proposed approach offers direct tests of between-model differences in the coefficients of interest, built from a principled statistical framework that can apply across a diverse array of situations of interest in contemporary social research. Their approach promises to provide a framework that can improve our methods of comparing effects across models, and it seems to provide an answer to many hotly debated methodological issues in the discipline. We expect readers will share in the excitement about its applicability to many research questions.
Multilevel Models
The data collection explosion of the past half century has made it easier for researchers to examine units that are nested, or otherwise embedded, within some other type of unit. Examples abound: multiple respondents surveyed in different years, respondents surveyed in different years in different countries, students observed in different grades in a school or in different schools, and so on. Such data permit the analysis of interesting sociological questions about how social contexts influence individuals. Accordingly, given the volume of data available to contemporary sociologists and the theoretical importance of questions about the ways context affects individual lives, a robust methodological literature has evolved for working with such data. This literature is voluminous, but there remain important areas for further development. Volume 49 of Sociological Methodology features two articles on topics related to hierarchical or multilevel modeling that advance different aspects of these approaches, both geared to practitioners and both recognizing problems in the standard toolkit of approaches to such data.
The article “Getting the Within Estimator of Cross-Level Interactions in Multilevel Models with Pooled Cross-Sections: Why Country Dummies (Sometimes) Do Not Do the Job,” by Marco Giesselmann and Alexander W. Schmidt-Catran, examines the limits of a common approach to studying hierarchically organized data. When analyzing cases where survey respondents are nested within country-years, which are in turn, of course, nested within countries, it is common for researchers to estimate within-country effects that seek to account for unobserved heterogeneity. Past research on this topic has demonstrated the desirable properties of such an approach for understanding country-level characteristics, but this work has devoted insufficient attention to the performance of cross-level interactions in these models. Using a combination of algebraic logic, Monte Carlo simulation experiments, and other arguments, the authors demonstrate that unobserved correlated moderators can be a substantial source of bias. The authors guide readers through the limitations of the standard approaches and discuss a new specification of the country fixed-effects estimator that allows researchers to obtain unbiased within-country estimates of such cross-level interactions when there is unobserved effect heterogeneity between countries. In addition, the article offers some thoughts on the decomposition of effects into within and between components in a random-effects framework. To highlight the importance of these results for practitioners, the authors reevaluate a previously published study that used a country fixed-effects approach to study the relationship between labor market policies and subjective well-being among people who are unemployed. Given the growing availability of international research data and interest in questions about national differences (in policy, economy, and social situations), this article will be of broad interest.
The second article that deals with issues pertaining to hierarchical or multilevel modeling, “Assessing Differences between Nested and Cross-Classified Hierarchical Models,” by David Melamed and Michael Vuolo, gives a detailed overview of the distinctions between cross-classified and fully nested models. Data are cross-classified when units at one level appear in multiple units at another level; for instance, when students are measured in two separate schools, each student’s record exists in two schools. In such cases, the data are not fully nested; that is, there is no way to organize the data such that each higher level unit contains all observations of each lower level unit, as would be the case if, for instance, no student ever switched schools but each student might appear multiple times (e.g., at different grades) in a given school. Cross-classified data are challenging to model in a computationally efficient manner. Prior work has proposed a number of shortcuts to make these problems analytically tractable, but a broad understanding of the tradeoffs these shortcuts entail is lacking. This article explicitly examines what such shortcuts mean for parameter estimates. Using a Monte Carlo framework, the authors conduct a series of simulation studies. They find that when outcomes are continuous, there are substantial differences between different shortcut approaches in the standard errors and the explained variances, indicating the likelihood of challenges for hypothesis testing; for binary outcomes, there are similar but less consequential issues. The authors conclude with a call for humility: When data are cross-classified but researchers model them as fully nested, it is imperative to explicitly discuss the limitations associated with this approach. Whenever feasible, cross-classified models should be used. However, this is not always possible, which means future scholarship should prioritize work on developing more computationally efficient approaches to handling such data.
Models for Diffusion in Social Networks
That actors are embedded in networks of social relations is a fundamental sociological insight that has generated a vast literature. In this line of research, the topic of diffusion—the spread of ideas, information, resources, behaviors, diseases, and other items through network ties—features increasingly prominently. Network models of diffusion have witnessed an explosion of research by sociologists and others in recent decades (Bearman, Moody, and Stovel 2004; Centola 2010; Centola and Macy 2007; Christakis and Fowler 2007; Goldberg and Stein 2018), building on classic insights about such processes (Coleman, Katz, and Menzel 1957; Granovetter 1973; Rogers 1962). Despite several methodological advances in the study of diffusion, two topics remain particularly underexplored: measurement error, specifically the uncertain accuracy of measured ties between actors that are developed from survey responses or other sources, and the dynamic co-evolution problem, whereby both actors’ attributes and network ties change over time in ways that impede the ability to distinguish cases where actors’ attributes change in response to diffusion from ties to peers with different attributes from cases where actors change their ties in response to their peers’ attributes.
The article “Social Space Diffusion: Applications of a Latent Space Model to Diffusion with Uncertain Ties,” by Jacob C. Fisher, tackles the first problem in studies of social network diffusion: Errors are inherent in the measurement of social networks. Such errors might arise from numerous sources, including but not limited to caps on the number of peers respondents can nominate in the “name generator” portion of a network survey, poor recall when selecting from a list all the people to whom one is tied, or non-real connections observable in online digital trace data (e.g., when two people are nominally connected on a social media platform but each “mutes” or otherwise suppresses posts from the other, making it unlikely they will see each other’s posts despite the appearance of a connection). Even if network ties were measured accurately at one point in time, networks are dynamic and may not reflect actual ties that exist at other points. In any case, networks are never measured perfectly, but studies of diffusion rarely recognize this. Fisher’s article proposes a new approach to address these limitations. He suggests using a latent space model of social networks based on observed ties and other features of the data to create a predicted network, where each tie is assigned a probability of existing, then simulating diffusion on the predicted network, with diffusion more likely over the ties with higher probabilities than the ties with lower probabilities. Testing this proposed method on longitudinal data from a set of schools in rural Iowa and Pennsylvania (the PROSPER study), he finds that a diffusion model based on the proposed latent space approach predicts attitude changes 10 percent better than a diffusion model using the measured network alone. These results suggest there is substantial promise for studies of diffusion to incorporate a more probabilistic framework that recognizes and explicitly addresses the innate measurement error that accompanies network data.
The second article on social networks in this volume, “No Longer Discrete: Modeling the Dynamics of Social Networks and Continuous Behavior,” by Nynke M. D. Niezink, Tom A. B. Snijders, and Marijtje A. J. van Duijn, deals with the second problem in studies of network diffusion: co-evolution. The co-evolution of actor attributes and network ties has certainly received a fair amount of methodological attention (e.g., Snijders 1996), but a central limitation of these studies is that they require actor attributes to be discrete. The article by this trio offers a methodological advance that does away with this requirement. To achieve this, the authors use stochastic differential equations. The article also contains other innovations likely to be of interest to researchers studying co-evolving network dynamics, including a measure of the amount of variance explained and a clear walk-through of how to interpret parameter estimates. To demonstrate the validity of this approach and offer a road map for others to implement these procedures, the authors use data from the Teenage Friends and Lifestyle study, which surveyed a cohort of 160 12- to 13-year-old students at a secondary school in Glasgow over a two-year period in the 1990s. Specifically, they study the co-evolving dynamics in the friendship network, individuals’ self-esteem, and alcohol use. Many readers will find the appendices of special interest, where the authors give a step-by-step description of how to estimate and interpret such models. These innovations will be very useful to researchers who wish to apply these methods to their own studies.
Notes and Comments
Despite limited space, the editors see justification for the inclusion of relevant commentary on previously published SM articles to improve the level of discourse in sociological methodology. In this volume we publish two comments, one by Benjamin F. Jarvis on an article published in SM volume 42 by Elizabeth Bruch and Robert Mare, “Methodological Issues in the Analysis of Residential Preferences and Residential Mobility” (see Bruch and Mare 2012). Jarvis’s comment (prepared with input from Bruch and Mare), “Estimating Multinomial Logit Models with Samples of Alternatives,” reviews the use of what sociologists and epidemiologists call conditional logistic regression but which economists and choice modelers call multinomial logistic regression (MNL). The comment corrects the sociological record with regard to methodological developments in econometrics and provides updated guidance for sampling in MNL models. Jarvis concludes that the advice given to researchers by Bruch and Mare (2012) may lead to bias in coefficient estimates in residential mobility research and the sampling corrections they encourage are not recommended.
A second comment, by Arvid Sjölander and Yang Ning, builds on an earlier SM article by one of the authors (Sjölander 2017). The earlier article developed an important extension of the case-time-control design, a tool that has existed in the epidemiological literature for the analysis of treatment effects, controlling for measured time-varying covariates and unmeasured covariates that are constant within persons over time. Prior to Sjölander’s article, the design was restricted to two time points and a binary exposure variable. His approach allows for an arbitrary number of time points and covariates and a nonbinary exposure. The present paper, “A General and Robust Estimation Method for the Case-Time Control Design,” presents a further improvement in the estimation strategy. The authors draw attention to the restrictiveness of the estimation methods proposed by Sjölander (2017) and offer an alternative: the conditional maximum-likelihood (CML) estimation method. The new method provides consistent estimates when the previous approach did not. This novel approach relaxes to a large extent the restrictive assumptions in the estimation strategy proposed originally. Through a series of simulations and the analysis of real data, the present article shows that their new estimation approach performs well under a range of scenarios relative to Sjölander’s (2017) estimation method.
Acknowledgments
Once again, we owe a great deal of appreciation to a number of individuals, without whose efforts this volume would not have been completed. Our managing editor, Lisa Savage, provided critical support in monitoring and tracking submissions as well as corresponding with authors. Stephanie Magean, who has long been SM’s copyeditor, worked on much of the material presented here, but her tenure as copyeditor was interrupted by a serious illness and her eventual passing. Stephanie worked on SM manuscripts for 27 years, beginning with volume 23 through to the present (see “In Memoriam” page vi). Mara Grynaviski, managing editor of the ASR, has been a terrific substitute for Stephanie, and we very much appreciate her help. Sara Sarver, our production editor at SAGE Publishing, helped deal with authors at the production stage and kept us on schedule to the extent possible. Karen Gray Edwards at ASA provided indispensable guidance and support, especially in helping us deal with the copyediting transition.
We wish to acknowledge the service of our editorial board as we welcome new members to the group: Jonathan Daw, Tyrone Forman, Lesa Hoffman, Laura K. Nelson, Alyson J. Pugh, Patrick Rafail, and Aliya Saperstein. We thank those editorial board members whose terms ended in December 2018—Paul D. Allison, Ronald S. Burt, Dana Garbarski, Melissa A. Hardy, Guillermina Jasso, Burt L. Monroe, and Robert M. O’Brien—for their expert advice and assistance.
Susan Welch, dean of the College of the Liberal Arts, Pennsylvania State University, deserves recognition for allowing us to locate the editorial office of SM in University Park and providing course release and financial support through our department.
We sincerely thank Peter V. Marsden for his willingness to write the dedication to Stanley Lieberson. It is an exquisite addition to this volume.
The cover art for this volume of SM is based on a photograph taken by Edgar F. Borgatta, the founder of SM. We thank Marie Borgatta for permission to reprint this photograph. We also thank Larry Chomsky (son-in-law to Ed and Marie Borgatta) for access to more than 60 flower photographs taken by Ed Borgatta and specifically for providing high-density scans of some that were chosen for the cover art of this and the three previous volumes.
