Effect of the Alvarado score on the diagnostic accuracy of right iliac fossa pain in an emergency

Abstract

Acute appendicitis is among the commonest surgical diseases faced by an emergency clinician. A prompt diagnosis is mandatory to reduce morbidity and mortality, which may be difficult for junior doctors who first see the patient in the emergency department. This study was conducted to assess the efficacy of Alvarado score as a tool to improve the accuracy of diagnosis of appendicitis. We did a prospective pre- and post-test study in 381 patients with right iliac fossa pain in the emergency department of BP Koirala Institute of Health Sciences. Alvarado scoring was used for the post-test study. We found that the Alvarado scoring system is an effective diagnostic method, which decreases overdiagnosis of appendicitis in the emergency department, and at the same time reduces the risk of missing acute appendicitis cases. It is inexpensive and easy to implement by junior doctors with good results.

Introduction

Junior house officers with less than 12 months' experience still staff the emergency departments of hospitals in most parts of Nepal. Delays in diagnosis or a wrong diagnosis can lead to problems in the management of an emergency. Appendicitis is a typical emergency condition that can be used to flag the accuracy of the diagnosis of the emergency department staff, as many patients present atypically and it is, therefore, imperative to diagnose and manage them early in order to avoid a high rate of complications.

In 1986, Alvarado developed a scoring system based on the important clinical parameters of acute appendicitis¹ and several studies have subsequently reported the importance of this scoring system in the diagnosis of appendicitis and also in order to exclude other differential diagnoses of right iliac fossa pain. However, the usefulness of this diagnostic method in the hands of inexperienced doctors in hospitals in developing countries has not so far been assessed.

The present study examines the way in which the application of the Alvarado scoring system has affected the diagnosis of acute appendicitis and the subsequent correlation with the clinical, operative and histopathological diagnoses.

Patients and methods

The term ‘pre-test’ was used to cover the ability of doctors to diagnose appendicitis before the scoring system had been introduced; ‘post-test’ covers their ability after the introduction of the scoring system.

The Alvarado score was introduced to the junior doctors after 100 patients with right iliac fossa pain underwent appendicectomy. They were shown how to complete the proforma. When they had been trained, the following 100 cases undergoing appendicectomy were recorded. The diagnosis of acute appendicitis using the Alvarado score was correlated with peroperative findings, histopathology and the diagnosis at the time of discharge.

Two sets of questionnaires were used by junior doctors during the emergency, one for pre- and the other for post-test. The variables studied were: name; sex; age; date; and initial diagnosis after the evaluation of the clinical features. The post-test questionnaire also recorded the clinical signs, symptoms and laboratory parameters used to calculate the Alvarado score (Table 1). The diagnoses made during the emergency were then compared with the final diagnoses which had been made based on the peroperative findings of those cases that subsequently underwent appendicectomy, histopathological findings and the final diagnosis on discharge by the operating surgeon. An appendix which was either inflammed with features of oedema, ulceration, gangren, or grossly inflammed cells with areas of necrosis microscopically was considered as appendicitis.²

Table 1

Alvarado scoring system

		Score
Features
Symptoms	Migratory right iliac fossa pain	1
	Nausea/vomiting	1
	Anorexia	1
Signs	Right iliac fossa tenderness	2
	Fever >37.30°C	1
	Rebound pain in right iliac fossa	1
Laboratory test	Leucocytosis (>10 × 109/L)	2
	Neutrophilic shift to the left >75%	1
Total score		10

Statistical methods

The difference in diagnosis in pre- and post-test groups was analysed by Epi Info 2000 using 2x2 tables for sensitivity, specificity, positive predictive value and negative predictive value.

Results

The total number of patients in the study were 381, including 210 cases for pre-test analysis and 171 cases for post-test analysis.

Pre-test analysis

An analysis was performed on 210 patients with right iliac fossa pain. The youngest patient was 6 years old and the oldest was 84. The mean age of patients in this group was 30.8. The majority of patients were aged between 11 and 40 peaking in the 21–30 age group. There were more males than females in the sample.

Of 210 patients, the initial diagnosis of appendicitis was made in 85 (40.4%), possible cases of appendicitis in 96 (45.8%) and some other diagnoses in 29 (13.8%). In the initial appendicitis diagnosis group, the final diagnosis of appendicitis was made for 56 (65.8%) and another diagnosis in 29 (34.2%). For the group who were diagnosed as possibly having appendicitis, 50 (52.1%) did have appendicitis and 46 (47.9%) did not. Of the group who were diagnosed as not having appendicitis, the final diagnosis of appendicitis was made in seven (24.1%).

In the pre-test group, who did not use the Alvarado score, the sensitivity of the duty medical officer and the diagnostic accuracy for ‘definite sure appendicitis’ was 49.6% (95% confidence interval [CI]: 40.5%–58.6%), specificity 70.1% (95% CI: 60.4%–78.3%), positive predictive value 65.9% (95% CI: 55.3%–75.1%) and negative predictive value 54.4% (95% CI: 45.7%–62.9%); the percentage for accurate diagnoses was 59.04%.

Post-test analysis

An analysis was performed on 171 patients of right iliac fossa pain. The clinical parameters included in the Alvarado scoring system were taken into consideration. The youngest patient in this group was 6 years old and the oldest was 90. The mean age of patients in this group was 28.96.

In 171 patients, an initial diagnosis of appendicitis was made in 116 (67.8%) patients, possible appendicitis in 18 (10.5%) and another diagnosis in 37 (21.7%). In the group who were initially diagnosed as having appendicitis, the final diagnosis was appendicitis for 111 (95.6%) patients and five (4.4%) had another diagnosis. In the group who diagnosed a possible appendicitis, the final diagnosis was appendicitis in seven (38.8%) and 11 (61.2%) had another diagnosis. The remaining 37 were subsequently found not to have appendicitis.

Using the Alvarado score, in the post-test group the sensitivity of the duty medical officer's diagnosis of ‘sure appendicitis’ was 94.1% (95% CI: 88.3%–97.1%), specificity 90.6% (95% CI: 79.7%–95.9%), positive predictive value 95.7% (95% CI: 90.3%–98.1%), negative predictive value 87.3% (95% CI: 76.0%–93.7%) and the accuracy of diagnosis was 93%.

None of the patients who were discharged as not having appendicitis came with complications such as appendicular abscess or peritonitis.

The presumptive diagnoses for the pre-test (210) and post-test (171) patients are shown in Table 2.

Table 2

Presumptive diagnosis for the pre-test and post-test patients

		Final diagnosis
	Initial diagnosis	Appendicitis	Other
Pre-test	Appendicitis
	85 (40.4%)	56 (65.8%)	29 (34.2%)
	Query appendicitis
	96 (45.8%)	50 (52.1%)	46 (47.9%)
	Other
	29 (13.8%)	7 (24.1%)	22 (75.9%)
Post-test	Appendicitis
	116 (67.8%)	111 (95.6%)	5 (4.4%)
	Query appendicitis
	18 (10.5%)	7 (38.8%)	11 (61.2%)
	Others
	32 (21.7%)	0 (0%)	37 (100%)

Summary of the analysis

The summary of the analysis is shown in Table 3. A test of proportion was used to compare the various diagnostic characteristics of pre- and post-test analysis. In the case of frequencies less than 5, the Fisher's exact test was used.

Table 3

Summary table for diagnosis of acute appendicitis

	Less rigorous: sure appendicitis + suspect appendicitis versus other diagnosis			More rigorous: sure appendicitis versus suspect appendicitis + other diagnosis
	Pre-test (%)	Post-test (%)	P value	Pre-test (%)	Post-test (%)	P value
Sensitivity	93.8	99.6	0.006	49.6	94.1	0.0001
Specificity	22.7	69.4	0.0001	70.1	90.6	0.004
Positive predictive value	58.6	87.8	0.0001	65.9	95.7	0.0001
Negative predictive value	75.9	98.7	0.002	54.4	87.3	0.0001
Accuracy of diagnosis (%)	60.9	90.6	0.0001	59.04	93	0.0001

A positive predictive value is the probability that a patient with a positive test result really does have the condition for which the test was conducted.

A negative predictive value is the probability that a patient with a negative test result really is free of the condition for which the test was conducted.

Discussion

Acute appendicitis is mainly a disease of adults – the mean age of sufferers is between 20 and 30.^3,4 The disease does not have any particular predilection for either sex.^3–5 In this study, the mean age of the patients in the pre-test and post-test groups were 30.78 and 28.96, respectively, with a male to female ratio of 55.6% to 44.4%.

Patients suffering from acute appendicitis require prompt diagnosis and management, because of its relatively high rate of complications^6–8 and further morbidity and mortality.^9,10 Therefore, quality decision-making in the management of acute appendicitis can be evaluated in terms of the postoperative normal appendix rate or diagnostic accuracy as judged by the operative findings, histopathology report and the discharge diagnosis.

Various diagnostic modalities have been used in order to diagnose acute appendicitis. These include ultrasonography; computed tomography and clinical evaluation; with varying rates of sensitivity, specificity, and diagnostic accuracy.^11–15 Before the advent of scoring systems in the diagnostic evaluation of acute appendicitis, the rate of postoperative normal appendix rates were quite high because the appendicectomies were performed with the view to minimizing the incidence of perforation and higher mortality.^16,17 With time, the belief that it would be possible to reduce morbidity and mortality was being questioned due to the feared complications of appendicitis, especially when the cost of the operation had been taken into consideration. Thus, various scoring systems, using various clinical parameters and laboratory values, came into existence and have undergone review processes from time to time.

A postoperative normal appendix was found in 7% in a pre-test analysis and 5% in a post-test analysis of the total of operated patients. The postoperative normal appendix in patients undergoing laparotomy for right iliac fossa pain ranged from 3% to 17.5% in other studies.^17,18 According to different authors, the acceptable negative appendicectomy rate, before the advent of scoring system, ranged from 10% to 30%.^16,17,19,20 The postoperative normal appendix rate in our study in the pre-test group was well below the acceptable value.

The duty medical officer diagnosed appendicitis in 65.8% cases without using any scoring system. In our study, the false positive rate was 34.2% and the false negative rate was 52.1%. There are various studies, both comparative as well as non-comparative, which have investigated the diagnostic accuracy of various scoring systems, but there are few studies of cases diagnosed by duty medical officers which did not use any scoring system as a control group. Considering the high false positive rate and false negative rate, potentially resulting in subsequent morbidity in the form of normal appendicectomy, the need for an appropriate scoring system using different clinical and easily available laboratory data is required.

In a previous study, it was found that no patient with a score below 5 had appendicitis¹⁷ but in our study four cases (11%) proved to have appendicitis even with an Alvarado score of 4 or 5. With a score of 3 or less it can confidently be said that there is no appendicitis. If the score is between 4 and 5, the patient needs observation and a review after 4–6 h and a re-evaluation of the score. We operated on patients with a score of 6 or more.

The Crnogorac et al. study found that the Alvarado score has a high sensitivity of 87%, average specificity of 60% and high diagnostic value of 82.7%.²¹ In the Chan et al. study, the positive and negative predictive values of Alvarado score were 77% and 97.6%, respectively.²² Denizbasi et al. found that the sensitivity and specificity of the Alvarado score were 95.4% (95% CI: 91%–99%) and 45.7% (95% CI: 37.4%–54%), respectively.⁵

The higher sensitivity and specificity found in our study is because ‘sure appendicitis’ and ‘suspected appendicitis’ have been distinguished. When ‘sure appendicitis’ and ‘suspected appendicitis’ were combined the sensitivity was 99.6% (95% CI: 96.1%–100.0%) and the specificity was 69.4% (95% CI: 56.2%–80.1%) which is closer to the results reported in other studies.

Comparing the pre-test and post-test groups, there was an improvement in diagnostic certainty from ‘suspected (?) appendicitis’ towards ‘sure appendicitis’ or ‘other diagnosis’. Sensitivity increased from 49.6% to 94.1% and specificity increased from 70.1% to 90.6%. Both the positive predictive value and the negative predictive value also improved significantly.

We concluded that the Alvarado score is a non-invasive, safe diagnostic procedure, which is simple, reliable and repeatable. It can be used in all conditions and is an inexpensive and uncomplicated supportive diagnostic method. The implementation of the use of the Alvarado score was shown to increase the certainty with which junior doctors made a diagnosis of acute appendicitis in the emergency department. This increase in performance will accrue benefits for patient care. It is recommended that the Alvarado score be used as a matter of policy in emergency departments. In cases where there is a paucity of diagnostic tools, clinical re-evaluations every 4–6 h are strongly recommended.

References

Alvarado

A practical score for the early diagnosis of acute appendicitis. Ann Emerg Med 1986;15:557–64

Mann

, Russell

RCG

, Williams

. Bailey & Love's Short Practice of Surgery. London: Hodder Arnold, 2006:829–40

Chan

MYP

, Tan

, Chiu

, Ng

. Alvarado score: an admission criterion in patients with right iliac fossa pain. J R Coll Surg Edinb Irel 2003;1:39–41

Cardall

, Glasser

, Guss David

. Clinical value of the total white blood cell count and temperature in the evaluation of patients with suspected appendicitis. Acad Emerg Med 2004;11:1021–7

Denizbasi

, Unluer

. The role of the emergency medicine resident using the Alvarado score in the diagnosis of acute appendicitis compared with the general surgery resident. Eur J Emerg Med 2003;10:296–301

Wilson

, Sinclair

, Mc Callion

, Potts

. Acute appendicitis in young children in the Belfast urban area: 1985–1992. Ulster Med J 1994;63:3–7

Hale

, Molloy

, Pearl

, Schutt

, Jaques

. Appendectomy: a contemporary appraisal. Ann Surg 1997;225:225–61

Korner

, Sondenaa

, Soreide

, Incidence of acute nonperforated and perforated appendicitis: age-specific and sex-specific analysis. World J Surg 1997;21:313–17

Hale

, Molloy

, Pearl

, Schutt

, Jaques

, J'Avis

. Appendectomy: improving care through quality improvement. Arch Surg 1997;132:153–7

10.

Styrud

, Eriksson

, Segelman

, Granstrom

. Diagnostic accuracy in 2,351 patients undergoing appendectomy for suspected acute appendicitis: a retrospective study 1986–1993. Dig Surg 1999;16:39–44

11.

Douglas

, Macpherson

, Davidson

, Gani

. Randomized controlled trial of ultrasonography in diagnosis of acute appendicitis, incorporating the Alvarado score. BMJ 2000;321:919–22

12.

McDonald

, Pendarvis

, Wilmoth

, Daley

. Influence of preoperative computed tomography on patients undergoing appendectomy. Am Surg 2001;67:1017–21

13.

Ford

, Passinault

, Morse

, Diagnostic ultrasound for suspected appendicitis: does an added cost produce a better outcome? Am Surg 1994;60:859–8

14.

Ramachandran

, Sivit

, Newman

, Schucartz

. Ultrasonography as an adjunct in the diagnosis acute appendicitis: a 4-years experience. J Pediatr Surg 1996;31:164–7

15.

Pena

, Taylor

, Tund

, Mandl

. Effect of computed tomography on patient management and costs on children with suspected appendicitis. Pediatrics 1999;104:440–6

16.

Kalan

, Talbot

, Cunliffe

, Rich

. Evaluation of the modified Alvarado score in the diagnosis of acute appendicitis: a prospective study. Ann R Coll Surg Engl 1994;76:418–19

17.

Colson

, Skinner

, Dunnington

. High negative appendectomy rates are no longer acceptable. Am J Surg 1997;174:723–6

18.

Fenyo

, Lindberg

, Blind

, Enochsson

, Oberg

. Diagnostic decision support in suspected acute appendicitis: validation of a simplified scoring system. Eur J Surg 1997;163:831–8

19.

Silberman

. Appendicectomy in a large metropolitan hospital. Retrospective study analysis of 1,013 cases. Am J Surg 1981;142:615–18

20.

Hoffmann

, Ramussen

. Aids in the diagnosis of acute appendicitis. Br J Surg 1989;76:774–90

21.

Crnogorac

, Lovrenski

. Validation of the Alvarado score in the diagnosis of acute appendicitis. Med Pregl 2001;54:557–61

22.

Chan

, Teo

, Ng

. The Alvarado score and acute appendicitis. Ann Acad Med Singapore 2001;30:510–12