Universal School Readiness Screening at Kindergarten Entry

Abstract

Researchers examined the concurrent and predictive validity of a brief (12-item) teacher-rated school readiness screener, the Kindergarten Student Entrance Profile (KSEP), using receiver operating characteristic (ROC) curve analysis to examine associations between (N = 78) children’s social-emotional (SE) and cognitive (COG) readiness with measures of behavioral/emotional risk and early literacy skills throughout kindergarten. Results indicated statistically significant associations between both subscales of the KSEP (SE and COG) with all outcome variables. Findings provide validity evidence in support of the KSEP as an initial gate in the universal screening process to inform educators on the readiness of incoming kindergarteners.

Keywords

kindergarten readiness universal screening KSEP

Kindergarten is the first formal year of schooling for most children, as approximately 98% of children in the United States attend kindergarten at school entry (Zill & West, 2001). Often, kindergarten provides the earliest opportunity to provide services to children in a single setting (Feeney-Kettler, Kratochwill, Kaiser, Hemmeter, & Kettler, 2010) and is an ideal time to identify and address early emotional, behavioral, and academic difficulties (Feeney-Kettler et al., 2010). Early identification and intervention is critical to the academic and social-emotional (SE) development of children, with documented benefits of early intervention in literacy (Denton, Fletcher, Anthony, & Francis, 2006) and emotional/behavioral functioning (Webster-Stratton & Reid, 2003).

The Kindergarten Student Entrance Profile (KSEP; Quirk, Rebelez, & Furlong, 2014) was developed to provide educators with an efficient universal screening tool for use during the first month of kindergarten. The KSEP assesses COG and SE aspects of kindergarten readiness, which is unique from many screeners that focus on either academic or SE functioning (Quirk, Furlong, Lilles, Felix, & Chin, 2011). Previous research has provided a foundation of psychometric evidence supporting the KSEP. KSEP ratings have been associated with a variety of academic outcomes throughout elementary school (Lilles et al., 2009; Quirk et al., 2011; Quirk, Grimm, Furlong, Nylund-Gibson, & Swami, 2016; Quirk, Nylund-Gibson, & Furlong, 2013). Research has also yielded evidence supporting the two-factor structure of the scale (Quirk et al., 2014), including evidence of invariance across student characteristics (Quirk, Mayworm, Edyburn, & Furlong, 2016; Quirk, Mayworm, Furlong, Grimm, & Rebelez, 2015). However, research has not yet examined whether cut points on the KSEP are useful in predicting problems later in kindergarten. Sensitivity, specificity, positive predictive values (PPV) and negative predictive values (NPV) merit attention when examining the predictive validity of a screener (Glover & Albers, 2007). Sensitivity is described as the proportion of students with problems who are detected by the screener (i.e., true positives). Specificity indicates the proportion of students without problems who are identified by the screener (i.e., true negatives). PPV refers to the proportion of students with positive screens who actually have problems, and NPV indicates the proportion of students with negative screens who do not have problems.

To improve interpretability of KSEP ratings and provide foundational psychometric information lacking in its current research base, it is necessary to investigate the concurrent and predictive validity of KSEP ratings with established measures of early literacy and emotional/behavioral functioning. The following research questions were examined:

Research Question 1: How well do KSEP ratings predict emotional/behavioral and academic outcomes based on receiver operating characteristic (ROC) curve analyses?

Research Question 2: Based on the determined cut scores, what are the sensitivity, specificity, PPV, and NPV of the KSEP risk classifications?

Method

Participants

Participants included 78 students (42 males) from four kindergarten classes in a public elementary school in California. When initially screened, 22% (n = 17) of the students were 4 years old and 78% (n = 61) were 5 years old. Participants were 67.9% Latino/a, 19.8% Anglo, 3.7% Asian American, 3.7% multiracial, 2.5% African American, 1.2% Native American, and 1.2% Filipino. Approximately 65% of the students were from families experiencing low income and were eligible for free or reduced-price lunch.

Instruments

School readiness

The KSEP (Quirk et al., 2014) is a brief, 12-item observational rating scale designed to assess children’s SE (six items) and COG (six items) readiness at the time they enter kindergarten. Teachers complete KSEP ratings based on observations and interactions with students during the first month of the academic year on a 4-point scale (1 = not yet, 2 = emerging, 3 = almost mastered, 4 = mastered) using a rubric that provides operational definitions and behavioral exemplars. Average ratings across items on each KSEP subscale ranged from 1 to 4, with an average rating of 4 indicating that a child has “mastered” every item on that subscale. Previous research (Quirk et al., 2014) has supported the two-factor structure of the KSEP and the reliability (SE, α = .88 and COG, α = .81) of KSEP ratings.

Early literacy

The STAR Early Literacy (RenSTAR; Renaissance Learning, 2009) assessed children’s academic risk status, with scores below the 25th percentile indicating risk (Fletcher, Francis, Morris, & Lyon, 2005). RenSTAR is a computer-adaptive curriculum-based measure comprised of 30-item tests assessing kindergarten students’ reading readiness skills (e.g., phonemic awareness, vocabulary). Previous research has found evidence supporting the reliability of RenSTAR scores and the validity of their use as a standardized measure of literacy achievement (Renaissance Learning, 2009).

Behavioral/emotional risk

The BASC-2 Behavioral and Emotional Screening System (BESS; Kamphaus & Reynolds, 2007) teacher rating scales (TRS) were used to assess children’s behavioral/emotional risk. The TRS includes ratings on a 4-point scale for each item (1 = never, 2 = sometimes, 3 = often, and 4 = almost always), and item responses provide a T-score with higher item ratings indicating higher risk (Kamphaus & Reynolds, 2007). T-scores between 20 and 60 indicate normal risk, scores between 61 and 70 suggest elevated risk, and scores of 71 or higher are categorized as extremely elevated. To facilitate analyses, each student with a T-score of 61 or higher was classified as having emotional/behavioral risk. Psychometric properties and information on scale development are available in the manual.

Procedure

All activities were approved by an institutional review board, which included active consent from all participants. The participating school was selected due to their expressed interest in early screening and an ongoing collaboration with the researchers. All kindergarten teachers at the participating school completed a 2-hr training on how to administer the assessments. Teachers completed KSEP ratings for all students during the first month of kindergarten and BESS ratings for all students at two time points (fall/spring). Students completed the RenSTAR in the fall/spring of kindergarten.

Results and Discussion

First, ROC curve analyses were conducted to assess the area under the curve (AUC) for use of (a) the KSEP SE in predicting BESS risk classification in the fall and spring and (b) the KSEP COG in predicting RenSTAR classifications in the fall and spring. In the fall and spring, 13.9% and 11.3% of students were classified as at risk by the BESS, and 14.9% and 17.8% of students were classified as at risk by the RenSTAR, respectively. As seen in Figure 1, AUCs were 0.79 and 0.80 for prediction of BESS risk classification in the fall and spring, respectively. AUCs were 0.81 and 0.87 for prediction of RenSTAR classification in the fall and spring, respectively (Figure 2). All AUCs were in the fair (.70-.79) to good (.80-.89) range (Youngstrom, 2014). Results indicate that KSEP scale scores did equally well at predicting behavioral/emotional and academic risk, both concurrently and longitudinally.

Figure 1.

ROC Curve analysis with KSEP SE scores predicting (a) fall and (b) spring BESS classification.

Figure 2.

ROC Curve analysis with KSEP COG scores predicting (a) fall and (b) spring RenSTAR classification.

Second, ROC curve analyses were used to determine optimal cut scores for risk on each of the KSEP domains. The pivot point was identified based on a leveling off of sensitivity accompanied by a decrease in specificity across the cut scores. This is depicted by the flattening out of the AUC at a sensitivity of approximately 0.80 in both curves in Figure 1. The goal in choosing a cut score was to find the point at which sensitivity and specificity were both as close to .75 or higher as possible, which is considered desirable (Glover & Albers, 2007; Levitt, Saka, Romanelli, & Hoagwood, 2007). However, what is seen as desirable varies across instruments and uses. If, for example, the consequences of a false positive are worse than the consequences of a false negative, then a lower PPV or NPV may be desirable. Based on examinations of sensitivity and specificity in predicting the outcomes of interest, cut scores between 3.25 and 3.40 emerged as the pivot point for each comparison. As the 40th percentile was 3.33 for both KSEP domains and was situated in the middle of the target range, 3.33 was chosen as the cut-point for risk status for both KSEP subscales (SE and COG). These results are consistent with previous research indicating children rated in these ranges have a stronger likelihood of academic success across the elementary grades (Quirk et al., 2011; Quirk et al., 2016).

Sensitivity, specificity, PPV, and NPV estimates were calculated using 3.33 as the cut score for risk on the KSEP SE and COG to predict risk status on the BESS and RenSTAR, respectively. Results in Table 1 indicate that sensitivity and NPV were strengths of the KSEP SE and COG risk scores. Sensitivity and PPV estimates are often considered the most important within a screening context, as low-sensitivity values may lead to under-identification of at-risk students, and low PPVs indicate a greater chance of over-identifying at-risk students (Glover & Albers, 2007). Sensitivity estimates suggest that KSEP catches most students presenting with social/emotional and academic problems. However, low-PPV estimates indicate that additional assessment, such as proposed within multiple-gating screening frameworks (Glover & Albers, 2007), could provide further differentiation between students who do/do not present with behavioral/emotional or academic problems.

Table 1.

Sensitivity, Specificity, PPV, and NPV Estimates.

	KSEP
	SE				COG
	Sensitivity	Specificity	PPV	NPV	Sensitivity	Specificity	PPV	NPV
BESS fall	.70	.68	.26	.93
BESS spring	.86	.64	.23	.97
RenSTAR fall					.70	.77	.35	.94
RenSTAR spring					.84	.77	.44	.96

Note. KSEP = Kindergarten Screening Entrance Profile; SE = social-emotional; COG = cognitive; PPV = positive predictive value; NPV = negative predictive value; BESS = Behavioral and Emotional Screening System; RenSTAR = STAR early literacy.

Finally, KSEP subscale scores were correlated with BESS and RenSTAR scores in the fall (concurrent) and spring (predictive). Table 2 presents these coefficients, all of which were in the anticipated direction and ranged from .60 to .74.

Table 2.

Descriptive Statistics and Concurrent and Predictive Validity Coefficients.

	KSEP
	SE	COG
	M = 19.80, SD = 3.84	M = 19.84, SD = 4.33
Concurrent
BESS (fall)M = 47.65SD = 10.89	−.74
RenSTAR (fall)M = 553.09SD = 112.92		.60
Predictive
BESS (spring)M = 46.29SD = 9.64	−.71
RenSTAR (spring)M = 608.01SD = 103.71		.61

Note. All correlations are significant at p < .001. KSEP = Kindergarten Screening Entrance Profile; SE = social-emotional; COG = cognitive; BESS = Behavioral and Emotional Screening System; RenSTAR = STAR early literacy.

Several limitations deserve mention. First, generalizability is limited due to the size and characteristics of the sample. In addition, data were collected over a single academic year and mostly via teacher report. Future research should examine how KSEP ratings predict a variety of academic and SE outcomes using multiple informants and measures and investigate how KSEP data can be utilized to support further assessment and early intervention efforts.

Efforts to proactively screen and provide early intervention for students are increasing (Glover & Albers, 2007). However, screening efforts are limited by the psychometric properties of the instruments used. Despite limitations, this study provides evidence of adequate convergent and predictive validity estimates and supports the use of a 3.3 cut score to optimize predictive validity estimates for KSEP subscales. Results also suggest that the KSEP could be a valuable resource as an initial gate in the universal screening of students at kindergarten entry.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported in part by the Society for the Study of School Psychology.

References

Denton

C. A.

Fletcher

J. M.

Anthony

J. L.

Francis

D. J.

(2006). An evaluation of intensive intervention for students with persistent reading difficulties. Journal of Learning Disabilities, 39, 447-466. doi:10.1177/00222194060390050601

Feeney-Kettler

K. A.

Kratochwill

T. R.

Kaiser

A. P.

Hemmeter

M. L.

Kettler

R. J.

(2010). Screening young children’s risk for mental health problems: A review of four measures. Assessment for Effective Intervention, 35, 218-230. doi:10.1177/1534508410380557

Fletcher

J. M.

Francis

D. J.

Morris

R. D.

Lyon

G. R.

(2005). Evidence-based assessment of learning disabilities in children and adolescents. Journal of Clinical Child & Adolescent Psychology, 34, 506-522. doi:10.1207/s15374424jccp3403_7

Glover

T. A.

Albers

C. A.

(2007). Considerations for evaluating universal screening assessments. Journal of School Psychology, 45, 117-135. doi:10.1016/j.jsp.2006.05.005

Kamphaus

R. W.

Reynolds

C. R.

(2007). BASC-2 Behavioral and Emotional Screening System manual. Circle Pines, MN: Pearson.

Levitt

J. M.

Saka

Romanelli

L. H.

Hoagwood

(2007). Early identification of mental health problems in schools: The status of instrumentation. Journal of School Psychology, 45, 163-191. doi:10.1016/j.jsp.2006.11.005

Lilles

Furlong

Quirk

Felix

Dominguez

Anderson

(2009). Preliminary development of the Kindergarten Student Entrance Profile. The California School Psychologist, 14, 71-80. doi:10.1007/BF03340952

Quirk

Furlong

M. J.

Lilles

Felix

Chin

(2011). Preliminary development of a kindergarten school readiness assessment for Latino students. Journal of Applied School Psychology, 27, 77-102. doi:10.1080/15377903.2010.540518

Quirk

Grimm

Furlong

Nylund-Gibson

Swami

(2016). The association of Latino children’s kindergarten school readiness profiles with Grade 2-5 literacy achievement trajectories. Journal of Educational Psychology, 108, 814-829.

10.

Quirk

Mayworm

Edyburn

Furlong

(2016). Dimensionality and measurement invariance of a school readiness screener by ethnicity and home language. Psychology in the Schools, 53, 772-784. doi:10.1002/pits.21935

11.

Quirk

Mayworm

Furlong

Grimm

Rebelez

(2015). Dimensionality and Measurement Invariance of a School Readiness Screener by Gender and Parent Education Levels. International Journal of School and Educational Psychology, 3, 167-177. doi: 10.1080/21683603.2015.1053644

12.

Quirk

Nylund-Gibson

Furlong

(2013). Exploring patterns of Latino/a children’s school readiness at kindergarten entry and their relations with grade 2 achievement. Early Childhood Research Quarterly, 28, 437-449. doi:10.1016/j.ecresq.2012.11.002

13.

Quirk

Rebelez

Furlong

(2014). Exploring the dimensionality of a brief school readiness screener for use with Latino/a children. Journal of Psychoeducational Assessment, 32, 259-264. doi:10.1177/0734282913505994

14.

Renaissance Learning. (2009). STAR early literacy. Wisconsin Rapids, WI: Author.

15.

Webster-Stratton

Reid

M. J.

(2003). Treating conduct problems and strengthening social and emotional competence in young children: The dina dinosaur treatment program. Journal of Emotional and Behavioral Disorders, 11, 130-143. doi:10.1177/10634266030110030101

16.

Youngstrom

E. A.

(2014). A primer on receiver operating characteristic analysis and diagnostic efficiency statistics for pediatric psychology: We are ready to ROC. Journal of Pediatric Psychology, 39, 204-221. doi:10.1093/jpepsy/jst062

17.

Zill

West

(2001). Findings from the condition of education 2000: Entering kindergarten. Washington, DC: National Center for Education Statistics.