Abstract
Eyewitness testimony forms an important component in deciding whether a case can be prosecuted. Yet, many criminal perpetrators deliberately conceal their faces with disguises or under dim lighting, undermining eyewitness accuracy. This article reviews recent studies to characterize the factors that impair face recognition performance, specifically, various forms of face disguise (e.g., face masks, sunglasses) and different lighting conditions. Research shows that identification accuracy, alongside eyewitness confidence and decision bias, all affect the reliability of eyewitness accounts. A consistent finding across studies is that face-identification accuracy can be improved by matching the viewing conditions during the police lineup with those during the crime (e.g., showing masked faces during the lineup should the perpetrator be masked). Current face recognition research provides specific recommendations for optimizing the procedures in eyewitness testimony.
Social Media Post
Eyewitness identification of faces is often undermined by dim lighting and disguises, such as wearing sunglasses or face masks. Reliability of eyewitness testimony can improve by matching lighting/disguise conditions in police lineups with those during crimes.
Highlights
Eyewitness testimony is important for criminal case prosecution. Yet, perpetrators often deliberately conceal their faces with disguises or act under dim lighting to avoid being recognized, undermining eyewitness accuracy.
Low lighting, change in lighting direction, covering the eyes (e.g., using sunglasses), and covering the lower face regions (e.g., with face masks) all impair face recognition performance.
The observer’s ability to accurately reidentify faces improves when the lighting or disguise conditions in the memory test match how these faces are first viewed (e.g., faces are reidentified better with masks than without when observers first view masked faces). This is consistent with the encoding specificity principle in memory research.
Accuracy of eyewitness testimony is improved by designing police lineups to match lighting/disguise conditions with those during crimes. This can be implemented by legislative codification of rules or practice guidelines.
Authorities should be aware that eyewitness reliability may also be influenced by the confidence and bias of the eyewitness, contributing to misidentification and wrongful convictions. However, research in this area in relation to lighting and disguise is scarce, requiring further investigation.
The courts should be properly informed of the conditions in which the reliability of eyewitness accounts can be compromised, especially when lighting and disguises are in the equation. A rule should be introduced allowing the court or the accused to appoint an expert to fulfill this purpose.
Crucial to criminal proceedings, the reliability of eyewitness testimony can matter the most in the context of perpetrator identification. This paper looks into how face memory (the main basis for eyewitness identification) may be influenced by lighting and disguises, according to current scientific accounts. Also discussed are confidence and bias in face recognition response, in addition to accuracy and sensitivity; all these may affect the reliability of eyewitness identification. By being more mindful of how these factors may contribute to misidentification, the odds of wrongful convictions can be reduced.
Measures of Face Recognition and Relation to Eyewitness Testimony
Signal detection theory (SDT) is the commonly used framework for measuring face memory. It provides a more accurate estimate of an observer's ability to remember faces without the influence of one's bias in response.
A Preliminary Note on the SDT
The SDT (Green & Swets, 1966; Macmillan & Creelman, 2005) describes the psychological processes underlying a wide range of phenomena in perception and cognition, especially in memory. For example, when remembering faces, memory tests often require an observer to distinguish seen from unseen faces. Every response in such memory tests is, in fact, influenced by both the observer's memory ability (known as
The key advantage of using SDT is that it allows researchers to estimate sensitivity and bias separately based on multiple memory responses, because SDT considers the two as independent psychological processes. Under the SDT framework, responses to a seen-versus-unseen face memory task can be categorized into four possible outcomes:
Counting the occurrence of each outcome enables researchers to separate sensitivity and bias. For example, high sensitivity is characterized by high hit and low false alarm rates, while a strong “seen” bias is characterized by high hit and high false alarm rates.
Characterizing Performance by Sensitivity and Bias
Face memory performance is commonly measured using
Thus, reporting accuracy alone may not provide sufficient information in understanding the observer's perceptual and decision-making processes. This is crucial to the reliability of eyewitness testimony. For instance, a biased memory response could lead to misidentification of an innocent person as the perpetrator (or the perpetrator being set free in the opposite direction of bias). Unfortunately, many studies focused solely on accuracy. While some reported sensitivity values, few reported response biases.
Confidence, Accuracy, and Eyewitness Testimony
Beyond sensitivity, bias, and accuracy, another important measure is
In the context of eyewitness testimony, such a confidence–accuracy relationship depends on multiple conditions. An extensive review found a strong relationship (positive correlation) between eyewitness confidence and identification accuracy when police lineup conditions are appropriate or “pristine,” for example, by including only one suspect per lineup, preventing the suspect from standing out in the lineup, using double-blind testing, etc. (Wixted & Wells, 2017). However, the review highlights that a confidence–accuracy relationship depends on the lineup condition, as nonpristine conditions can compromise this relationship. Thus, the review recommends adopting these pristine lineup conditions and avoiding nonpristine ones.
Individual differences can also affect the confidence–accuracy relationship. An online, mock-crime study shows that the probability of making a high-confidence error depends on a collection of factors. This includes decision time, eyewitness's general ability to remember faces, and the type of justification given after choosing the suspect in the face-identification task (Grabman et al., 2019). As these factors are largely related to how well faces are represented in memory, the study's findings, in general, support the importance of face-representation quality: The better a face image is represented in the brain, the more likely the observer is going to be accurate when giving a high-confidence response.
Accuracy, confidence, and, critically, the relationship between the two depend on many other factors. Lighting and disguises are two important factors that have rarely been examined in previous research. Nevertheless, this is an important future direction for both experimental and applied research, as it bridges the theories (e.g., face perception and memory) to real-world applications (e.g., eyewitness testimony and lineups).
Effect of Lighting and Disguise on Face Recognition
The following sections will review the effects of lighting and disguises on face recognition performance and discuss their effects on accuracy, sensitivity, confidence, and bias, particularly in relation to designing eyewitness lineups. Most studies involve laboratory experiments on face recognition aimed at understanding the basic science of face perception. Some are applied studies, aimed specifically at improving eyewitness reliability. Most of these basic science studies are
Lighting
Half of the crimes occur in conditions of low lighting (Felson & Poulsen, 2003). When lighting level drops from daylight to starlight conditions, humans’ visual acuity falls sharply (Ferwerda, 1998). This calls into question the accuracy of eyewitness identification in the context of such adverse viewing conditions, as several studies suggest that low lighting levels negatively impact face recognition performance (e.g., DiNardo & Rainey, 1989; 1991; Nyman et al., 2019; Yarmey, 1986). The following sections review the effects of changing uniform or nonuniform lighting. When the same lighting level is applied to the entire face image, the lighting is
Uniform Lighting: Dim versus Bright
Face recognition and memory are, in general, impaired under uniformly dim lighting. Early face memory research reported that dimmer lighting led to lower accuracy and confidence in identifying faces than did brighter lighting (DiNardo & Rainey, 1989; Yarmey, 1986). More recently (Nyman et al., 2019), a simulation of eyewitness lineups across three broad-range lighting levels (0.7–300 lx; simulating starlight conditions to a brightly lit office space) found face recognition in lower lighting conditions to be significantly less accurate, alongside a slower response and lower confidence. Across all lighting levels, higher confidence was associated with higher accuracy, demonstrating a positive confidence–accuracy relationship. Interestingly, accurate responses were faster only under higher lighting levels. The poorer face recognition performance under dimmer lighting can be attributed to a reduction in the amount of available information being extracted from dimmer faces, following observations from the impact of dim lighting on nonface picture recognition (e.g., Loftus, 1985).
Nonuniform Lighting: Direction of Lighting and Cast Shadow
Although research on nonuniform lighting has been limited, some evidence indicates that changes in lighting direction impair face recognition. For instance, face matching performance was found to be poorer when matching a top-illuminated face with a bottom-illuminated face, than when matching two faces illuminated from the same direction (Hill & Bruce, 1996). In a series of face memory and face matching tasks (Braje, 2003; Braje et al., 1998), changes in lighting direction also led to lower sensitivity and slower responses. Cast shadows in these studies did not seem to affect sensitivity, though sometimes led to slower responses in face recognition. However, cast shadows have been found to improve recognition performance for nonface objects (Castiello, 2001; Tarr et al., 1998). More research is needed to understand the effect of nonuniform lighting on face recognition.
Congruence in Lighting Between Face Memory Encoding and Retrieval
Face memory performance is generally better when uniform lighting levels match between study and test, but performance is impaired when they do not match. For instance, when bright lighting was used during the face memory test, face recognition performance was poorer when faces were initially studied under dim lighting than under bright lighting (De Jong et al., 2005; Wagenaar & Van der Schrier, 1996). However, these studies did not present faces under dim lighting during the face memory test. Thus, it was unclear whether police lineups should be set in dim lighting in order to match the lighting of crimes happening in the dark.
More recently, another study systematically examined how changes in lighting between study and test affected face memory (Lim et al., 2022). Lighting was
These findings are consistent with the
In contrast, an earlier study on the effect of uniform lighting changes suggested that brighter lighting at either encoding or retrieval stages resulted in higher sensitivity, with similar bias levels across conditions (DiNardo & Rainey, 1991). These findings run contrary to the encoding specificity principle, though it can be explained by the narrow range of lighting levels, both of which were too dim. For instance, their “bright” conditions still fell within the range of lighting on a full moon night.
Research on the effect of nonuniform lighting on face recognition also supports the encoding specificity principle, where congruent lighting directions between study and test resulted in better face recognition performance than incongruent lighting directions (Braje 2003; Braje et al., 1998; Hill & Bruce, 1996).
Nevertheless, research on the effects of lighting on face recognition remains scarce. It is especially unclear how the brain processes facial information under extremely low lighting levels, as in some possible crime scenes. More research in this area is needed.
Disguise
A face disguise covers or modifies parts of the face to alter perception of one's identity, which often impairs face recognition. However, the degree of impairment depends on the disguised facial regions. Given the prominent role of the eyes (within the upper region) in face recognition and the renewed interests in the effects of sanitary masks (which occlude the lower facial region) following the COVID-19 pandemic, this section focuses specifically on disguises that affect either the upper or lower facial region.
Previous studies on disguises typically applied realistic disguises (e.g., sunglasses, sanitary masks) or occlusions with image-editing techniques (e.g., blackout of the eyes). Most of these studies investigated the disguise effects in more artificial lab settings (e.g., computerized test), though a handful attempted to simulate real-world conditions (e.g., police lineups) to yield more generalizable results.
The Upper Facial Region: Eyes
Research points to the essential role of the eye region in providing major diagnostic information for face recognition. For example, face-selective neurons were found to be especially sensitive to the eyes (Freiwald, Tsao & Livingstone, 2009; Issa & DiCarlo, 2012; Nemrodov et al., 2014; Nemrodov & Itier, 2011; Schyns, Bonnar & Gosselin, 2002; Vinette, Gosselin & Schyns, 2004). Humans often direct their gaze firstly to the eyes, which have been shown to maximize information retrieval from faces (Barton et al., 2006; Henderson, Williams & Falk, 2005; Or, Peterson & Eckstein, 2015; Peterson & Eckstein, 2012), allowing faces to be recognized within the first couple of eye movements (Hsiao & Cottrell, 2008). Face recognition also improves when the eye region information is effectively used (Hills, Ross & Lewis, 2011; Royer et al., 2018; Sekiguchi, 2011).
As a result, occluding the upper facial region generally impairs face recognition. Earlier studies, which digitally removed or occluded the eyes, found various kinds of impairment on face memory, including increased recognition error and looking time (McKelvie, 1976), slowing of familiarity judgment on celebrity faces (Roberts & Bruce, 1988), or a decrease in its accuracy (Sadr, Jarudi & Sinha, 2003). The eyebrows (in addition to the eyes) have been found to be crucial in familiarity judgment (Sadr et al., 2003).
Later studies used real-life images or more advanced photo-editing techniques to introduce more realistic disguises, though results generally remained the same in that face recognition is impaired with upper region disguises. For instance, sunglasses impair face memory (Hockley et al., 1999; Nguyen & Pezdek, 2017, Or et al., 2023), regardless of the race of the face image (Nguyen & Pezdek, 2017). Sunglasses also impair face matching performance, even for “super-recognizers” (e.g., Ramon, 2021) with above-average accuracy in face recognition (Bennetts et al., 2022; Graham & Ritchie, 2019; Noyes et al., 2021). Regular glasses, which do not occlude but modify the appearance of the upper face region, can also lead to poorer face memory (Righi, Peissig & Tarr, 2012; Terry, 1993; 1994) and face matching performance, especially when a face with glasses is matched with one without (Graham & Ritchie, 2019; Kramer & Ritchie, 2016). Face matching performance also declined when matching a face wearing sunglasses with one wearing regular glasses (Graham & Ritchie, 2019).
Some studies show that more disguises on the upper facial region lead to greater impairment on face recognition. For instance, faces with more upper area covered (e.g., with sunglasses and toques, or plain stockings down to the nose) led to poorer performance and lower confidence in face recognition than did faces with a single type of disguise in simulated police lineups (Mansour et al., 2020). Changes in hairstyles (e.g., with wigs), in addition to having glasses, also impair face memory (Righi et al., 2012).
The Lower Facial Region: Nose and Mouth
Research following the COVID-pandemic has agreed that a sanitary face mask, which covers the mouth and part of the nose, impairs face recognition in face memory tests (Freud et al., 2020; 2022; Garcia-Marques, Oliveira & Nunes, 2022; Guerra et al., 2022; Hsiao, Liao & Tso, 2022; Marini et al., 2021; Or et al., 2023; Stajduhar et al., 2022) and face matching tasks (Bennetts et al., 2022; Carragher & Hancock, 2020; Estudillo, Hills & Wong, 2021; Noyes et al., 2021). The impairment is reflected by lower accuracy, lower sensitivity, and/or a larger bias. Despite extensive exposure due to pandemic requirements, accuracy in recognizing masked faces in face memory tests has not improved over the years (Freud et al., 2022). In contrast, performance for matching masked faces may be improved by diagnostic feature training (Carragher et al., 2022).
Other Forms of Disguises
A limited number of studies also suggest that face recognition may be impaired by occluding the head outline (e.g., headscarf: Megreya, Memon & Havard, 2012; Toseeb, Keeble & Bryant, 2012; Wang et al., 2015), disguising the entire face by a patterned/plain stocking (Davies & Flin, 1984; Mansour et al., 2020), or adding facial hair (beard/mustache: Dhamecha et al., 2014; Foley & Foley, 1998; Terry, 1994). Their impacts on eyewitness identification remain to be understood.
Comparing Disguises on the Upper and Lower Facial Regions
Although research suggests that both upper and lower region disguises impair face recognition, it remains unclear which area of disguise has a larger impact. Early studies seem to suggest that occluding the lower facial region does not impair face recognition as much as occluding the upper region does. For instance, blacking out the mouth resulted in fewer recognition errors than blacking out the eyes did in face memory tests, though both led to similar confidence ratings (McKelvie, 1976). These results suggest that accuracy and confidence may not always correlate. Blacking out the lower region also does not slow down familiarity judgment (Roberts & Bruce, 1988), but blocking the lower head outline and features down from nostrils (e.g., using bandanas) seems to impact face memory more for African Americans than Caucasian Americans (Nguyen & Pezdek, 2017).
However, more recent studies comparing the effects of sunglasses and sanitary masks showed inconsistent results. For example, one study on average observers and super-recognizers found poorer face matching performance for masked faces than faces with sunglasses (Noyes et al., 2021), whereas another study found comparable performances (Bennetts et al., 2022). Face memory has worsened less with masks than sunglasses, to the extent that sensitivity only declined under incongruent masking (Or et al., 2023). This is consistent with early studies suggesting a smaller impact from occluding the lower facial region than the upper region. Nevertheless, future studies should investigate further the relative impacts of upper and lower region disguises and their implications on eyewitness identification.
Congruence in Disguise and Encoding Specificity
Many face recognition studies have investigated the effects of congruence in disguise across memory stages. In general, when disguises are
Incongruent Disguises Lower Accuracy?
Face memory accuracy, in general, declines more under incongruent than congruent disguises, regardless of disguise in the upper or lower facial region (masking eyes or mouth: McKelvie, 1976; ski masks: Manley, Chan & Wells, 2019; sanitary masks: Garcia-Marques et al., 2022; Hsiao et al., 2022; Marini et al., 2021; sunglasses or sanitary masks: Or et al., 2023; and regular glasses: Righi et al., 2012; Terry, 1993; 1994). These findings suggest that the effects of disguises align with the encoding specificity principle (but see, e.g., Freud et al., 2020 and Guerra et al., 2022, for the lack of encoding specificity), which is further supported by studies using face matching tasks (glasses/sunglasses: Bennetts et al., 2022; Graham & Ritchie, 2019; Kramer & Ritchie, 2016; Noyes et al., 2021).
Incongruent Disguises Lower Confidence?
Observers tend to be less confident when disguises are incongruent between study and test. When disguise is present in only one of the memory stages, confidence is lower than when it is present in both study and test, as shown over time: in early lab studies that blacked out facial features (e.g., McKelvie, 1976), in later studies using realistic disguises such as sunglasses (e.g., Hockley et al., 1999), and in recent studies using mock-lineup configurations (Manley et al., 2019). In a more recent study (Manley, Chan, & Wells, 2022), beyond a general, positive correlation between confidence and accuracy regardless of masking, higher confidence ratings have been found for an all-masked lineup than a no-mask lineup. This finding supports the encoding specificity principle as the perpetrator always wore masks during study (encoding phase). Thus, the encoding specificity principle may also apply to memory confidence.
Incongruent Disguises Affect Response Bias?
In contrast, response bias may be more affected by the presence of disguise itself than congruence. Generally, decisions in face memory tasks become more liberal (i.e., more likely to indicate faces as “seen”) when the face is disguised (Garcia-Marques et al., 2022; Guerra et al., 2022; Or et al., 2023), though the direction of bias (whether conservative or liberal) due to disguise is inconsistent across face matching studies (Bennetts et al., 2022; Graham & Ritchie, 2019; Kramer & Ritchie, 2016). In addition, when a mixed list of masked and unmasked faces were studied, decisions became more liberal (with a lower sensitivity) toward masked faces than unmasked faces in the test (Garcia-Marques et al., 2022). This suggests that biases in face memory tasks can be systematically manipulated based on encoding-retrieval congruence.
Recommendations and Future Directions
Accurately identifying the perpetrator is critical. Any mistakes from eyewitness accounts can potentially lead to wrongful convictions, especially when the court does not apply evidential and procedural rules rigorously (e.g., exclusionary rules of evidence, standards of proof such as requiring independent corroboration), favoring otherwise weak evidence from the prosecutor. Yet, perpetrators would often attempt to conceal their identity with disguises or by acting under dim lighting to avoid being recognized. Thus, subsequent reidentification of faces would no doubt be challenging when first viewed and encoded under such adverse conditions.
A consistent finding across studies on lighting and disguises is the importance of presenting faces under congruent conditions. By matching the lighting conditions or types of disguises, the observer's ability to accurately match or reidentify faces during subsequent testing improves greatly. Conversely, incongruent viewing conditions significantly lower the observer's accuracy, despite any confidence attached to the response. The adversarial tradition of common-law criminal justice systems, in which the court does not play an active role in gathering evidence, needs to be alive to these factors.
As face memory operates largely according to the encoding specificity principle, one possible way to improve the accuracy of eyewitness testimony is to present police lineups in conditions that match how the face of the perpetrator has been encoded by the eyewitness during the time of the crime. For example, should the crime take place at night (or when the perpetrator is disguised), authorities could consider presenting eyewitness lineups in dim lighting (or with similar disguises). This aims at setting up more pristine lineup conditions (Wixted & Wells, 2017), which may benefit eyewitness confidence and identification accuracy.
The recommended mode of implementation is legislative codification of rules or practice guidelines. To complement this (or as an alternative), during the court proceedings, a rule should be introduced such that the court should be properly informed of how the reliability of eyewitness accounts can be materially compromised when lighting and disguises are in the equation. Ideally, the court should appoint an expert to do this methodically through the lens of what has been discussed here, especially if the case turns on the identification of the perpetrator and there is no corroborating evidence. In jurisdictions where the courts are not at liberty to appoint experts, it would be in the interests of the accused to do so.
Footnotes
Acknowledgements
We thank Neva Or for the comments.
Author Note
The authors C.C.-F. Or and A.L.F. Lee are the members of the Vision Sciences Society (VSS).
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by NTU CoHASS Start-Up Grant, CoHASS Incentive Schemes, and Singapore MOE AcRF Tier 1 Grant 2018-T1-001-069 and 2019-T1-001-064 to C.O., 2019-T1-001-060 to C.O. & A.L., and the General Research Fund (Project No. LU13603220) from the Research Grants Council and the Research Matching Grant Scheme (Project No. LU13603220) from the University Grants Committee of the Hong Kong Special Administrative Region, China to A.L.. We thank Neva Or for comments.
