Abstract
With a rising number of cranioplasty (CP) procedures and an increasing percentage of patients with a good clinical outcome and prolonged survival after CP, looking at the functional and aesthetic outcome of these patients becomes more and more important. The aim of our study was to evaluate a novel score, combining functional and cosmetics aspects after CP, created at our institution: the Rostock Functional and Cosmetic Cranioplasty (RFCC-) Score. A total of 27 patients were enrolled, representing all indications for a secondary CP after decompressive craniectomy or extended temporal trephination with a complete separation of the temporalis muscle. Besides the clinical evaluation, five different already established clinical rating systems were tested and compared with our score. For reasons of objectivity, the score was also tested against the patient's own rating. Our findings showed that the RFCC-Score, derived from a health professional, is superior to other scoring systems, which only display a facet of the functional state of the patient. Our score is objective and independent of a disposition for a depression of the patient. It can be obtained without the need of a verbal communication, making it applicable for nearly all patients after CP. The score is time-saving, clearly arranged, and reliable, which are inevitable requirements for the comparing and evaluation of different surgical techniques and associated complications of CP.
Introduction
C
One of the first cranial reconstructions using an autologous bone graft was described by Müller and König in 1890. 4,5 Synthetic materials were tested largely as bone substitutes for CP 3,6 to prevent complications associated with the use of autologous grafts (aseptic bone necrosis, infection, reduced strength of the graft) and in cases in which the initial bone graft could not be reused such as in cases of an infiltrating tumor. Besides focusing on the materials used in CP, with a sinking mortality rate, more attention was paid toward the cosmetic results after CP.
CP aims to restore the integrity of the skull not only for safety or cosmetic reasons, but also to improve the neurological condition of patients with the so-called syndrome of the threphined. 7,8 CP can restore the physiological cortical blood flow and cerebrospinal fluid (CSF) circulation. 9 During the last decades, the number of CP procedures has been rising continuously, mainly as a consequence of a rising number of decompressive craniectomies for patients with space-occupying ischemic strokes. 10 –15 Decompressive craniectomy may also be indicated in patients with malignant brain edema of various etiologies such as traumatic brain injury (TBI), intracerebral bleeding (ICB), and after aneurysmal subarachnoid hemorrhages (aSAH). 16 –21 Other reasons for the initial removal of the skull bone are osseous tumors or meningiomas with bone infiltration. Subsequently, the number of patients in all these circumstances with a good clinical outcome is rising, 11 with the need for CP as a second operative procedure.
Beside cosmetic reasons, the functional aspect after CP is of special interest. Besides scarring and wound healing issues, the main attention is drawn toward the temporal muscle and its role in CP for several reasons. During the initial operation, the muscle is often damaged while detaching it from the temporal bone. 22 The dissection of the muscle initially and again during the CP procedure often leads to scarring, muscle retraction, and atrophy, partly caused by impaired vascular supply and denervation. The muscle loss and functional impairment can be related to masticatory difficulties because of an impaired ability of mouth opening and jaw movement, and cosmetic impairment per temporal hollowing. 23 –25
One major drawback of nearly all recently published studies had been the fact that there is no existing clinical scoring system to rate the results after CP and make them comparable. 22,26 –28 In particular, there is no score to our knowledge that combines functional and aesthetic aspects after CP, both being equally of utmost significance. Therefore, the aim of our study was to design and validate a clinical score that is able to rate both functional and cosmetic results after CP and that makes clinical results comparable after different techniques of CP. In addition, besides being simple and accurate, obtaining the score should be independent of the possible interaction with the patient, making it eligible also for patients who lost their ability of proper communication because of aphasia after a cerebral infarction of the dominant hemisphere.
Methods
In this observational single-center study, we tested and validated our Rostock Functional and Cosmetic Cranioplasty (RFCC)-Score in terms of objectivity and reliability. Institutional Review Board approval was obtained before the start of this study (Reg.nr A2016-0018). Written consent was obtained from all patients.
Patient selection
Included were 27 adult patients (age >18 years) who had been treated at our department with a decompressive craniectomy and subsequent CP or after an extended temporal trephination with a complete separation of the temporalis muscle from the skull and subsequent/secondary CP. Follow-up visits were made at the earliest at six months after CP to test our score at a finite state of muscle atrophy, scarring, and the resulting functional limitations. All participants were able to communicate in a proper way and express their own experiences and valuation of the functional and cosmetic results.
Clinical examination
Mouth opening was measured as interincisal distance with a curved ruler, as well as the sideways jaw movement. Cutoff values for a physiological mouth opening were ≥38 mm, respectively, ≥9 mm for sideways jaw movement. 29,30
Additional scorings
To evaluate the objectivity and reliability of our score, we used in addition four different patient-based scoring systems and one grading system relying on the doctor's assessment: • House-Brackmann facial nerve grading system was used to assess the facial function and its potential impact on jaw movement.
31
This grading system was used independently of an existent peripheral or central paresis, because we focused on the lower facial expression and not the forehead movement. House-Brackmann grade was assessed by the same neurosurgeon (CH). • JFLS-8 (Jaw Functional Limitation Scale, 8-Item) is a patient-based scoring system to determine the orofacial function and its impact on quality of life.
32
Its results were grouped into four scales (scale 0: 0 points; scale 1: 1–3 points; scale 2: 4–7 points; scale 3: 8–10 points).
31
• CES-D (Center for Epidemiological Studies-Depression) is a 20-item patient-based screening score that helps identify individuals at risk for clinical depression.
33
Cutoff values for patients at risk for depression were ≥16 points (score 1), respectively, ≥22 points detecting a severe depression (score 2). • EQ-5D-5L is a health-related five-level and patient-based quality of life measurement.
34
The obtained results were age-adjusted and compared with a validation set resulting in the EQ-5D-5L index value.
35
• VAS (Visual Analog Scale) is a continuous response scale ranging from 0 to 100 describing general health.
34
RFCC-Score
Our score consists of four equally rated items covering the functional and cosmetic results after CP. Each item consists of three answers, judged with ascending points given (one, two, or three). All answers were added up, so possible scores ranged from four (best)–12 (worst) points.
• Scar/Skin
• Bland, good adaption, not thinned 1 point
• Irritated, reddened, no dehiscence, desquamation 2 points
• Dehiscence, thinned skin, translucency of sutures 3 points
• Fitting of cranioplasty
• Good fitting, no fissuring 1 point
• Little fissuring (more palpable rather than visible) 2 points
• Clearly visible fissuring/gapping, resorption 3 points
• Symmetry
• Symmetrical appearance 1 point
• Little asymmetry 2 points
• Clearly visible asymmetry 3 points
• Function
• No limitations 1 point
• Little restrictions in jaw opening/movement 2 points
• Restricted jaw opening/movement 3 points
The score results were classified as good post-operative results (4–6 points), acceptable post-operative results (7–9 points), and poor post-operative results (10–12 points). A higher value at our RFCC equates to a worse post-operative result. The thresholds between the three ratings of the result after CP (good, acceptable, and poor) are equally distributed because the answers for each item are also equally given with a uniform weighting. The score was applied by the patient and the neurosurgeon on the date of the follow-up visit.
Statistical analysis
All data were stored and analyzed using SPSS 22.0 software (SPSS, Chicago, IL). Descriptive statistics were computed for continuous and categorical variables. The statistics computed included mean, median, standard deviation (SD), minimum and maximum of continuous variables (outlined as mean ± SD), frequencies, and percentages of categorical factors. Test selection was based on evaluating the variables for normal distribution employing the Kolmogorov-Smirnov test. Testing for differences of continuous variables between study groups was accomplished by two-sample t test or the Mann-Whitney U test, as appropriate.
All p values resulted from two-sided statistical tests, and values of p < 0.05 were considered to be statistically significant. The relationship between two variables was evaluated by the Spearman rank correlation coefficient.
Direct comparison of RFCC-Scores obtained from the patient and the surgeon was performed graphically. The differences of the RFCC-Scores (patient – surgeon) per patient were displayed on the y axis. Limits of agreement (95%) were calculated with the formula “mean value of difference ±1.96 × SD of difference.” For better analysis of the agreement of both RFCC-Scores, we tested the difference by using Cohen ƙ coefficient. 36
Results
A total of 27 patients met the inclusion criteria, with a median age of 52 ± 11.51 years and a slightly female dominance (1.25:1) (Table 1). The prevalent reason for the initial operation was a cerebral infarction with consecutive space-occupying edema (44.4%), followed by aSAH (18.5%) and TBI (18.5%). Two patients were included after an initial tumor removal (meningioma and an osteolytic bone cyst) with an extended temporal trepanation (7.4%); three patients had ICB and the need of a decompressive craniectomy (11.1%). A total of 38 CP operations were performed on the 27 patients included with a mean number of 1.52 implants per patient, meaning that 37% of the enrolled patients had a complication with the need of a secondary CP.
aSAH, aneurysmal subarachnoid hemorrhage; TBI, traumatic brain injury; ICB, intracerebral hematoma; CP, cranioplasty; CAD/CAM, Computer-Aided Design/Computer-Aided Manufacturing.
The main reason for a re-operation was superficial or deep infection of the surgical site (64.3%). An aseptic bone necrosis was recorded in two cases (14.3%), which led to discarding the bone and implanting a Computer Aided Design/Computer-Aided Manufacturing (CAD/CAM) CP, being in accordance with other studies. 37 In concordance with our institutional policy, primary implant for CP should always be the autologous bone (68.4%). Mean follow-up period after the last CP was 56.3 ± 46.20 months (range 9–142 months).
The clinical examination of the patients showed a slight impairment of the facial function evaluated by the House–Brackmann facial nerve grading system (Table 2), with a median grading of 2 and five patients with a moderate dysfunction (grade 3). Higher grades of dysfunction were not observed in our cohort. The mouth opening was recorded in 18.5% of all cases as impaired; median interincisal distance was 47 mm with a cut-off value of 38 mm as a sign of impaired temporalis muscle function. In contrast to these findings, sideways jaw movement was found in 29.6% cases within the physiological range of ≥9 mm.
JSLF-8, Jaw Functional Limitation Scale, 8-Item; CES-D, Center for Epidemiological Studies-Depression; EQ-5D-5L, EuroQol five-dimensional five level; VAS, visual analogue scale; RFCC-Score, Rostock Functional and Cosmetic Cranioplasty-Score.
The patient-based JFLS-8 showed that 62.9% of all patients enrolled had no (score 0) or a little impairment of their orofacial function (score 1); on the other hand, 25.9% had severe limitations (score 3). For a detailed analysis of our patient cohort, all participants completed a screening for signs of depression, maybe affecting the other patient-based scores (CES-D); 51.9% showed no signs of a depression, 25.9% were at risk of depression, and 22.2% showed signs of a severe depression. EQ-5L-5D index values showed a mean of 0.81 ± 0.10, and the VAS a mean of 60 ± 29.3. Compared with the general German population value sets, the VAS of our cohort is clearly beyond the “reference value” of a mean VAS of 85.1. 38
Looking at the RFCC-Score results obtained by the neurosurgeon, in 15 patients the post-operative result was rated good (55.6%); 12 patients showed an acceptable result (44.4%). No poor post-operative result was rated in our cohort by the neurosurgeon. In contrast, only six patients rated their post-operative result as acceptable (22.2%), and 21 patients as a good result (77.8%); also, no poor result was rated by the patients.
For validation of the RFCC-Score, we analyzed the different clinical measurements and scores and their impact on each other (Table 3). Interestingly, the number of CP operations performed for each patient influenced significantly the individual mouth opening (p = 0.001), having no impact on the other scores. The House-Brackmann grading was correlated positively with the RFCC-Score obtained from the surgeon, but not from the patient (p = 0.004 [correlation coefficient 0.54] and p = 0.458 [correlation coefficient 0.15], respectively). In addition, the facial nerve function significantly influenced both quality of life scores EQ-5D-5L and VAS (p = 0.017 and p = 0.035, respectively). The mouth opening had no influence on these two scores (p = 0.825 and p = 0.779, respectively); the jaw sideways movement significantly influenced only the VAS (p = 0.019) with no impact on the EQ-5D-5L (p = 0.106).
RFCC, Rostock Functional and Cosmetic Cranioplasty Score; JFLS-8, Jaw Functional Limitation Scale, 8-Item; CES-D, Center for Epidemiological Studies-Depression; EQ-5D-5L, EuroQol five-dimensional five level; VAS, visual analogue scale.
Statistically significant values are presented in bold (significant ≤0.05).
The JFLS-8 was not correlated with the RFCC-Score obtained by the surgeon or the patient (p = 0.625 and p = 0.286, respectively—data not shown), Neither did the measured mouth opening nor the jaw movement (p = 0.739 and p = 0.188, respectively) influence our scoring system.
The RFCC-Scores, obtained from the patient and the neurosurgeon, showed a high correlation (p = 0.018) (Table 4). Looking at the agreement of the two scores, Cohen ƙ analysis showed a poor agreement with a coefficient of 0.239 (quadratic weighted 0.373). These observations were supported by the Bland-Altmann plot (Fig. 1), which showed identical scorings in 37% of all obtained RFCC-Scorings. There were 11 patients (40.7%) who rated the post-operative results better than the surgeon (negative difference); six patients (22.2%) scored the results worse (positive difference).

Plot of the differences between the Rostock Functional and Cosmetic Cranioplasty (RFCC)-Scores obtained from the patient and the neurosurgeon per patient. No observed differences were displayed as zero on the y axis; 95% limits of agreement were displayed as reference lines (2.76 and −3.80, respectively).
RFCC, Rostock Functional and Cosmetic Cranioplasty-Score.
Statistically significant values are presented in bold (significant ≤0.05).
Discussion
The number of decompressive craniectomies is rising constantly and thereby the number of CP procedures. In addition, the temporal and pterional approach are the most commonly used in the field of neurosurgery. The common feature of all these procedures is the dissection or mobilization of the temporal muscle, which later on needs to be refixated. Associated problems are temporomandibular disorders, temporal hollowing with aesthetic impairment, and post-operative pain. 23 –25 These issues are yet underrepresented in the existing literature regarding clinical outcome reports, beside their rising importance for the patient and quality of life after CP.
In the present study, we designed and validated a novel clinical score, reflecting the functional and aesthetic results after CP—the RFCC-Score. The elected cohort consisted of typical patients who underwent a decompressive craniectomy, followed by CP (81.5%). Extended temporal trephination and CP represented 18.5% of cases (patients after standard temporal or pterional approaches were not included to test our score) within a patient cohort, where the temporal muscle had been totally mobilized from the temporal bone, carrying the biggest risk of a functional impairment and temporal hollowing developing. Mean follow-up period was 56.3 month, ensuring a long-term cosmetic and functional result. Figure 2 is showing a typical patient enrolled in our study at a follow-up visit 13 months after CP.

Computed tomography (CT) scans and clinical picture of a typical patient enrolled in our study. (
The jaw-specific function measurements showed an impaired sideways movement in 30% of cases, but 63% of all enrolled patients complained of no or just little impairment of their orofacial function (JFLS-8). The mouth opening, on the other hand, was impaired in 19% of our cohort. Correlation analysis showed a stronger interconnection of the jaw sideways movement toward the JFLS-8 than toward the mouth opening (p = 0.188 and p = 0.739, respectively), besides both not being statistically significant. These findings were supported by the results from our RFCC-Score, being more correlated with the jaw sideways movement than with the mouth opening (RFCC surgeon p = 0.283 and p = 0.564, respectively). One possible explanation is the selected cutoff value for a physiological mouth opening of 38 mm, diverging results from other studies and a great interpersonal variety. 30 Interestingly, the mouth opening was strongly influencing the disposition for depression (p = 0.058).
The House-Brackmann score was significantly correlated with the RFCC-Score obtained from the surgeon (p = 0.004), but not the one obtained by the patient (p = 0.458). The facial nerve function, on the other hand, strongly correlated with the quality of life of the patients, assessed by the EQ-5D-5L and VAS, but did not influence the other measurements or scores. Therefore, the surgeon RFCC-Score reflects more the orofacial aspect, which is related to the quality of life of the patients after CP, rather than the single muscle function. This thesis is supported by a missing correlation between the RFCC-Scores obtained from both the surgeon and the patient and the JFLS-8. An individual's perception of pain after CP, for example, is often underrepresented and underestimated in post-operative assessments. 39 Rocha-Filho and colleagues 25 reported in 2007 that 47.9% of patients after pterional craniotomy for aneurysm complained of headache during dental evaluation, 28.2% reported pain during normal jaw movement. 29,30 These functional aspects will always influence the individual experience and rating of his or her own clinical state. Therefore, it is important to involve these aspects into a scoring system without losing the advantage of an objective observer.
The difference between the RFCC-Scores obtained by the patient and the surgeon was independent of a disposition toward depression and the quality of life assessed by the patient, revealing that our score is objective and independent of the perception of patients potentially influenced by a depression. Individuals with a higher RFCC-Score obtained by themselves and a corresponding positive difference toward the surgeons scoring had 19.8 points at the CES-D, compared with 14.3 points mean of the patients without a difference or a negative difference towards the surgeon scorings. These results are showing clearly that patients with a disposition for depression tend to rate their functional and aesthetic results after CP inferior compared with the more objective results gained from the surgeon.
There was little agreement between the two RFCC-Scores, revealed by the plot and the Cohen ƙ coefficient. These findings are in concordance with the correlation analysis and what we expected. The diverging results and the influence of the individual's emotional condition and percipience on the patient scorings makes our RFCC-Score superior to patient-based scoring systems. One additional and very important benefit is the independence of our score from the patient's ability to speak. Just a minimum of interaction is required, such as the ability to open the mouth and to move the jaw.
Today, there is still a lack of good and satisfactory clinical follow-up data after CP procedures, not only concerning the general outcome, but also looking at complications. Putting us into a position in which we can compare different surgical techniques and used materials for CP is an importance aim of two registers started in the United Kingdom and Germany. 40,41 From our point of view, it is inevitable, especially within the setting of a prospective registry, to gain valid but compressed data that can reflect the functional state and quality of life of the patient.
In contrast to other assessments, our score is able to represent these individual aspects in a bigger dimension than others, being more influenced by the functional and cosmetic aspects in total, rather than the sheer muscle function. One drawback of our study is the limited number of patients being enrolled. A larger study is therefore already being conducted, testing our RFCC-Score in general and especially for interobserver variability. Another restriction of our scoring system has to be considered: the obtained score result is dynamic over time. During the follow-up period, the result will change because of an ongoing muscle atrophy or an aseptic bone necrosis with an increased gapping between the bone fragment and the skull. Therefore, the score result as a tool for the assessment of the post-operative functional and aesthetic outcome after CP should always be rated considering its dependence on the moment of examination.
Footnotes
Acknowledgments
The authors thank Prof. Günther Kundt for his valuable statistical suggestion and the patients who participated in this study.
Author Disclosure Statement
No competing financial interests exist.
