Abstract
Abstract
We investigated pronoun usage by sexual predators in their online interactions with “decoys” (i.e., adults identifying themselves as children). Computerized text analysis was conducted for 561 instant messaging conversations between contact-driven solicitation offenders (CDSOs) and decoys. CDSOs not only used significantly fewer first-person and more second-person singular pronouns than the decoys but also used more second-person pronouns than adult romantic partners discussing their courtship. The other-focused nature of CDSOs' text messages is consistent with theories of the luring process positing predators' emphasis on making their targets feel special to initiate the cycle of entrapment.
Introduction
T
Contact-driven solicitation offenders (CDSOs hereafter) are the focus of the present study. Although there has been extensive research on sexual predation, pronoun usage has not been exclusively investigated in this context. Several recent studies have investigated the linguistic patterns of CDSOs, but none of the studies have looked exclusively at pronouns.4–6 The current study is one of the first studies to investigate specific pronoun usage in this context and compare these linguistic patterns to their counterparts in nondeviant romantic interactions.
Olsen et al. offer a general model of the “luring” process child molesters use to contact and control potential victims that we will use to frame the communication strategies used by CDSOs in particular. 7 According to the theory, luring begins with a “gaining access” phase, in which perpetrators strategically place themselves in positions to encounter targets. Once strategically placed, the perpetrator will initiate a “cycle of entrapment” by cultivating a deceptive sense of trust in a selected target by using flattery, humor, and promises of attention or gifts, which then escalates to grooming the target for sexual contact.
This is done through subtle communication strategies, such as discussing prior sexual experiences or showing the target pornographic images. Finally, the perpetrator attempts to isolate the target from parents, guardians, and others in the social network and ultimately approaches the target by explicitly arranging a meeting for sexual contact.
The present study explored chat transcripts via computerized text analysis, with a focus on CDSOs' language use in interactions with decoys as they attempt to cultivate a sense of trust. Although their messages are misleading and ultimately malicious in intent, it is plausible that their ostensible relationship-building interactions with decoys exhibit linguistic patterns similar to those observed in nondeviant romantic relationships.
Partners in developing romantic relationships “leak” relational beliefs and evaluations in implicit language forms. In their articulation of the “interactional” view of interpersonal communication, Watzlawick et al. drew a distinction between the explicit, literal meaning of a discourse message and an implicit meaning reflecting relational dynamics. 8 Interpersonal scholars have since identified a variety of linguistic devices that encode these implicit meanings.
One such device received significant attention over the years. Some researchers have argued that first person plural pronouns (we, us, our) are markers of shared identity, affiliation, and interdependence. 9 Although increased use of we has been found in written descriptions of highly committed romantic relationships, 10 other research has found that spoken we usage frequency has no association with relationship satisfaction11–13 and only a marginal association with relational interdependence. 14
Findings regarding usage of first person singular pronouns (I, me, my) have also been mixed. Sillars et al. discovered couples who used fewer first-person singular pronouns tended to have higher relationship satisfaction than those who used them more frequently. 11 However, Simmons et al. found that I usage to be marginally positively associated with relationship satisfaction and me to be positively associated with negative behaviors during problem-solving discussions. 12 Use of I, these researchers argue, reflects self-disclosure and perspective taking, whereas use of me reflects feeling of passivity and victimization.
Finally, usage of second-person pronouns (you, your) by couples in problem-solving discussions has been shown to be negatively correlated with relationship satisfaction 11 and positively correlated with negative relationship behaviors,12,13 a pattern that reflects excessive other-focused attention, but in an accusatory manner. 15 Although CDSOs are not building upon established relationships the way couples are, these findings provide evidence that pronoun usage in a “romantic” setting (not that there is anything romantic about adults trying to sexually solicit minors) can reflect relational dynamics.
The romantic partners participating in the studies reviewed above are different in many important respects from CDSOs attempting to lure perceived targets into criminal sexual contact. However, both groups put considerable effort into cultivating trust and intimacy with desired others, although for deceptive and malicious purposes in the case of CDSOs. A comparison of CDSOs' pronoun usage with romantic couples engaged in relationship-building conversations may prove useful in providing insight into the underlying linguistic patterns of child sex offenders and how those patterns differ from nonoffenders.
The association in pronoun use between partners in an adult romantic relationship and CDSOs could take several forms. Although the empirical evidence has been somewhat inconsistent, CDSOs may have similar I usage to romantic couples because they want to focus more on the child than themselves, similar to how satisfied partners focus more on each other rather than themselves. On the contrary, romantic couples have an already established relationship and may be more comfortable talking about themselves, which means CDSOs may use fewer singular first-person pronouns as they do not want to shift the conversation away from the target.
A similar argument could be made for the use of second-person pronouns, where CDSOs' excessive focus on the target may manifest in higher usage of you compared with others because his goal may be to flatter the target by focusing on attractive characteristics. However, a second possibility is that the CDSO will attempt to create a sense of belonging, which will result in few second-person pronouns so as not to make the child feel isolated, which could be related to why negative relational outcomes are positively associated with second-person pronoun usage. 11
If a strategy of CDSOs is to build intimacy, then it can be accomplished by focusing on the cohesiveness of the relationship. This cohesiveness can be achieved by combining the two individuals into a single entity (e.g., you and me → we). The use of we can have an effect on the perceived closeness between individuals 16 and also relates to relational interdependence. 14 So when a CDSO is interacting with a child, the intimacy-building they engage in might be reflected in their use of plural first-person pronouns.
Unfortunately, there have only been a handful of studies done on the linguistic patterns of CDSOs, so it is unclear how their language will be compared with that of romantic couples. As a result, the current project attempts to better understand the strategies CDSOs use by looking more closely at their pronoun use and comparing it with that of romantic couples, a group that frequently engages in intimacy and relationship building. However, because the pronoun literature regarding relational intimacy is inconsistent at times, we attempt to answer the following question:
There is, however, another group of individuals who need to be considered in the current study. Decoys are adults that converse with the CDSOs by imitating adolescents, and because they are concerned with portraying themselves as authentic adolescents, they should be unconcerned with building intimacy. Furthermore, decoys should only be concerned with passively capturing CDSOs who are trying to engage in sexual relationships with children, not actively persuading individuals to meet them, else they run the risk of being accused of entrapment. The language of decoys has yet to be systematically studied. Therefore, we pose an additional question:
Materials and Methods
The dataset
The current study analyzed online chats between convicted CDSOs and decoys. A publicly available archive of 623 chats (see Fig. 1 for an excerpt of the chats) is available on

Chat transcript excerpt.
Importantly, they allow the CDSO to initiate contact with the decoy first to avoid police entrapment (the Department of Justice defines entrapment as the act of implanting a criminal act in an innocent person's mind and then inducing the act to later convict that individual of the crime), making the choice to continue the conversation completely up to the CDSO and ensure that the legal evidence against CDSOs is permissible in court.
All 622 chat transcripts on the website are between child decoys and male CDSOs who were convicted of attempting to harm a child. Extracted chats were divided into CDSO and decoy language segments using a script built in the Python programming language. The script was designed to remove nonchat text (commentary by the decoy, links, irrelevant webpage text, etc.) and to calculate the frequency of pronouns produced by each chat participant (Python syntax is available from the first author upon request). A specialized list of pronouns (Table 1) was built to account for all of the various ways individuals may choose to spell these words when engaged in text-based communication. In addition, it should be noted that some individuals produced text in a haphazard and ungrammatical form, such as accidentally typing the first letter of a word at the end of the previous word, which made it unrealistic to create a script that would capture all such mistakes accurately:
Note: This pronoun dictionary was custom built to account for the many grammatical and spelling errors in the chat transcripts. The analysis was case-insensitive.
Decoy: hey
CDSO: wats up good looking
Some even chose not to include any spaces between words (e.g., “howareyoudoing?”), posing a problem for the script when counting pronouns that consist of letters that can be found within other words (e.g., “me” in “meeting”). Due to the size of the corpus and the length of each chat, having coders manually read through the chats and separate words was not feasible.
After all the chats were processed with the script, pronoun frequencies were calculated for each participant in each chat, total word counts were calculated for CDSOs and decoys, and the number of boy and girl targets was calculated. Gender counts were possible because decoys posted this information for most of the chats, but not all.
Comparison data
To compare pronoun usage of CDSOs with that produced by romantic couples, pronoun frequencies were obtained from Kurlak. 17 In her study, romantic couples discussed (with each other) relational values either in person or using a computer. No significant differences between communication modes were found, so the two were combined. The average length of the face-to-face discussions was 410 words, while the average length of the computer discussions was 256 words. The final sample used in the current study consisted of 29 heterosexual couples, all of which have been in the relationship for at least 6 months (M = 21.7, SD = 20.2).
After making a comparison to Kurlak's data, data were obtained from Slatcher et al.
18
It was important to have an additional comparison to replicate the results of the first comparison. By using Slatcher et al.'s data, we were able to determine if our findings were because of the comparison data we obtained or because of actual linguistic differences between CDSOs and romantic couples. Their study focused on text message correspondences between romantic partners during a 10-day period. These data were obtained under more naturalistic settings (i.e., couples text messaging each other throughout their daily lives) and may be a more fitting comparison for predatory chats (the full corpus of predatory chats can be accessed by contacting the first author at
Results
Corpus descriptives
Out of the total 622 chats, 561 were in acceptable condition to run analyses (i.e., all irrelevant text was removed and chats were separated into CDSO and decoy chats). Sixty-two were excluded because the chats were in a format that was unrecognized by the script and therefore were unable to be extracted. A chat could have been unrecognizable if the decoy posted it in a format that was unique in comparison to the majority, such as writing out “Bob's screen name” rather than using the CDSO's actual screen name (which was important in determining whether the message was written by the decoy or the CDSO).
The total number of words in the corpus was 5,133,278, of which CDSOs produced 3,122,884 words, and decoys produced 1,948,439 words. The average number of words produced by a CDSO per chat was about 5,566 words, while the average for decoys was 3,881 words. Five hundred nine chats indicated the gender of the decoy, 30 of which were boys and 479 were girls, which indicate a clear female preference.
Pronoun analysis and comparisons to romantic couples
To explore how CDSOs use pronouns compared with romantic couples, the frequency of first-person singular, second-person, and first-person plural pronouns were compared to Kurlak's and Slatcher et al.'s data.17,18 First, we will discuss the comparison to Kurlak's data. A multivariate analyses of variance was conducted with CDSOs, female decoys, male decoys, romantic females, and romantic males as groups for all three pronouns, which was significant, Wilk's λ = 0.54, F(12, 2971.47) = 64.13, p < 0.001, and partial η2 = 0.18. For first-person singular pronouns, the groups differed significantly from each other, F(4, 1125) = 75.9, p < 0.001, and ηp 2 = 0.21. A post hoc Scheffe analysis was conducted to account for the difference in sample sizes (also used in the comparison to Slatcher et al.'s data).
CDSOs used significantly fewer first-person singular pronouns (M = 8.4, SD = 1.9) than romantic female partners (M = 10, SD = 1.88), female decoys (M = 9.21, SD = 1.7), and male decoys (M = 10.14, SD = 1.95). Romantic female and male partners did not differ significantly from each other, but male partners differed significantly from female and male decoys (p < 0.05).
When looking at usage of second-person pronouns, CDSOs used significantly more (M = 8.17, SD = 1.97) than romantic females (M = 6.51, SD = 1.57), romantic males (M = 6.03, SD = 1.52), female decoys (M = 6.39, SD = 1.57), and male decoys (M = 5.22, SD = 1.65), F(4, 1125) = 65.98, p < 0.001, and ηp 2 = 0.19. The other groups did not differ significantly from each other.
Finally, when comparing the use of first-person plural pronouns, CDSOs used fewer (M = 0.57, SD = 0.35) than romantic females (M = 1.3, SD = 0.66) and romantic males (M = 1.1, SD = 0.79), but more than female decoys (M = 0.43, SD = 0.28), F(4, 1125) = 66.35, p < 0.001, and ηp 2 = 0.19. CDSOs' usage did not differ from male decoys. In addition, female and male decoys used less first-person plural pronouns than romantic females and males. Overall, group type accounted for 18% of the variance in pronoun usage. See Table 2 for all Kurlak comparisons.
Note: Different superscripts indicate statistically significant differences at p < 0.05 using Scheffe's post hoc analysis. In other words, means that have the same superscript did not significantly differ.
CDSO, contact-driven solicitation offenders.
Next, females and males from Slatcher et al.'s study were compared with CDSOs and decoys. The multivariate test was significant, Wilk's λ = 0.51, F(12, 3154.03) = 75.63, p < 0.001, and partial η2 = 0.2. Results indicated that female decoys (M = 10.55, SD = 2.1) and male decoys (M = 11.33, SD = 2.56) used first-person singular significantly more than CDSOs (M = 8.41, SD = 1.9), romantic females (M = 8.15, SD = 1.60), and romantic males (M = 8.06, SD = 1.30), F(4, 1194) = 98.69, p < 0.001, and ηp 2 = 0.25. CDSOs, romantic males, and romantic females did not differ significantly.
CDSOs used you words (M = 8.17, SD = 1.97) significantly more than all other groups, while female decoys (M = 6.44, SD = 1.64) used you significantly more than romantic females (M = 4.96, SD = 1.69) and romantic males (M = 4.65, SD = 1.29), but less than CDSOs, F(4, 1194) = 127.08, p < 0.001, and ηp 2 = 0.30. Romantic females, romantic males, and male decoys did not differ significantly from one another.
Female decoys also used fewer we pronouns (M = 0.43, SD = 0.28) than CDSOs (M = 0.57, SD = 0.35), romantic females (M = 0.61, SD = 0.39), and romantic males (M = 0.58, SD = 0.44), but did not differ from male decoys (M = 0.5, SD = 0.30), F(4, 1194) = 13.95, p < 0.001, and ηp 2 = 0.05. The other groups did not significantly differ from one another. Overall, group membership accounted for 20% of the variance in pronoun usage. See Table 3 for all Slatcher et al. comparisons.
Note: Different superscripts indicate statistically significant differences at p < 0.05 using Scheffe's post hoc analysis. In other words, means that have the same superscript did not significantly differ.
Discussion
Overall, the results show that pronoun use tends to differ among CDSOs, decoys, and romantic couples. Specifically, CDSOs use more you words than decoys and romantic couples, while I and we differences were less consistent. CDSOs try to portray trust and warmth when they interact with children, 19 which can come in many forms, such as compliments. However, it is hard to say how CDSOs might choose to convey this “warmth” because one CDSO may choose to compliment the child's physical features, while another might choose to talk about how he will lavish the child with gifts.
It was hypothesized that this warmth would come across through the use of we as a result of the CDSO trying to portray closeness, but the results only provided partial support. What was clear was that CDSOs used significantly more you pronouns when talking to the decoys, suggesting that they were more focused on the decoy than themselves. This emphasis could be a symptom of trust building, which is an integral part of luring, 6 and a straightforward way of conveying interest and attention to the child.
Our results provide some evidence that CDSOs have similar pronoun use to romantic couples, but perhaps more interesting is how they are distinct. CDSOs tend to use more second-person pronouns than romantic partners or child decoys. This difference suggests that they are more other-oriented in their focus. They attempt to push the focus back onto the child either because they themselves want to focus more on the child or they want to avoid having the focus on them.
Focusing more on the child could aid the CDSO in portraying care or concern (e.g., “you are better than that” or “you are so cute”) and result in the child being more receptive of them. This pattern was also observed in Black et al.'s analysis when they found that almost all CDSOs used flirtation as a seduction device. 5 Similarly, CDSOs used fewer I than decoys and the romantic partners in Kurlak's dataset. This could be because CDSOs are trying to shift the focus off of themselves to the child. If the conversation is more focused on the CDSO, children may begin to distance themselves because they feel that the CDSO is not paying attention to them.
Although CDSOs use you consistently more than all of the groups in both datasets, their use of we was not as consistent. This can be accounted for by the designs of the two experiments, where Kurlak's participants were instructed to discuss their relationship, while Slatcher's participants merely provided their daily correspondence with their relational partner. Therefore, it is not surprising that couples in the Kurlak's study had significantly more we usage because they were actively discussing their relationships (e.g., “
There are two limitations that should be noted. First, the comparison data were only meant to test if CDSOs' language differed from romantic couples' language. Future comparisons should be made with adults actively engaged in courtship initiation (e.g., online dating) to get a clearer picture of how CDSOs' differ from nondeviant counterparts engaged in flirting. Second, our corpus comprised CDSOs caught by police, whose luring techniques may differ from those not convicted. However, currently there is no feasible way to obtain data from CDSOs who have not been caught.
The linguistic tendencies of CDSOs have recently become of interest to researchers because of advancements of computational analyses of language. This area of inquiry is important because in the past, researchers and practitioners analyzed the motivations and grooming strategies of sexual offenders through observation and clinical interactions. Nuances in their language use revealed by computational analyses can enable us to better understand these motivations and strategies. The current study contributes to this understanding by comparing CDSO pronoun use to nondeviant romantic interactions. Understanding that child sexual offenders speak to their targets in a more other-oriented tone compared with couples provides initial evidence that their language shows unique patterns. Understanding these patterns can help clinicians and practitioners understand these individuals and their motivations on a deeper psychological level, and future researchers can build upon this study in pursuit of accurate ways of identifying predatory language online.
Footnotes
Acknowledgments
We thank Rebecca Kurlak and Richard Slatcher for allowing us to use their data in our comparisons.
Author Disclosure Statement
No competing financial interests exist.
