Abstract
Study Design
Original Research.
Objectives
This study aimed to evaluate the preoperative cross-sectional area (CSA) and muscle quality of the major lumbar muscles in patients with adolescent idiopathic scoliosis (AIS) and to investigate their associations with perioperative outcomes. The study also explored lumbar muscle morphology as a potential imaging-based surrogate marker for perioperative risk stratification.
Materials and Methods
In this retrospective study, 81 consecutive AIS patients who underwent posterior spinal fusion at a single institution between January 2019 and December 2023 were included. Two trained pediatric radiologists independently reviewed preoperative T1-weighted axial MRI scans at the L3 level to assess muscle CSA, Modified Goutallier grade, and threshold-based parameters of fatty infiltration. Interobserver reliability was evaluated using intraclass correlation coefficients. Multivariable linear regression analyses were performed to assess associations between muscle parameters and perioperative outcomes, adjusting for relevant covariates. Receiver operating characteristic (ROC) analysis was used to identify curve-specific muscle thresholds for perioperative risk stratification (P < .05).
Results
Patients were stratified according to Lenke type: 1-2 (Group 1, n = 30), 5 (Group 2, n = 14), and 3-4-6 (Group 3, n = 37). Fatty infiltration of the multifidus and erector spinae muscles was significantly higher in Groups 2 and 3 compared with Group 1 (P < .05). Associations between muscle morphology and perioperative outcomes were curve-pattern dependent, with no significant correlations in Group 1. In Group 2, larger functional CSA of the multifidus, paraspinal muscles, and quadratus lumborum was independently associated with increased blood loss (β = 98.06, P = .001; β = 17.49, P = .017; β = 29.3, P = .014). Greater psoas fatty infiltration predicted delayed ambulation (β = 0.236, P = .005). In Group 3, increased paraspinal fatty degeneration predicted longer operative duration (β = 8.42, P = .015). ROC analysis identified curve-specific MRI-derived thresholds for perioperative risk stratification.
Conclusion
Preoperative lumbar muscle quality, rather than muscle size, was associated with perioperative complexity and recovery in AIS surgery in a curve-pattern–dependent manner. MRI-based muscle assessment may represent a potential tool for perioperative risk stratification, although prospective validation is required.
Keywords
Introduction
Surgical correction of adolescent idiopathic scoliosis (AIS) represents one of the most demanding procedures in spine surgery, characterized by long operative times, substantial blood loss, and potential perioperative complications.1,2,3,4 In this setting, accurate preoperative assessment is crucial for optimizing outcomes and minimizing morbidity.5,6 Traditionally, predictive factors have included demographic, radiographic, and clinical variables.6,7,8 Recently, however, increasing attention has focused on muscle morphology and quality, particularly the cross-sectional area (CSA) and fatty degeneration of paraspinal and hip flexor muscles, as potential markers of physical reserve and postoperative recovery capacity.9,10,11,12,13
In adult spine surgery, sarcopenia and fatty muscle degeneration have been increasingly recognized as indicators of poor physiological status and adverse surgical outcomes.14,15,16,17,18,19 Several studies have reported that reduced psoas or paraspinal CSA and increased fatty infiltration correlate with longer hospital stay, higher complication rates, and delayed functional recovery after spinal procedures.14,15,17,18,19 These findings underscore the increasing clinical significance of muscle health as a component of preoperative risk stratification.
However, to the best of our knowledge, the prognostic role of muscle morphology has not yet been systematically investigated in pediatric populations, particularly in patients with AIS. The adolescent musculoskeletal system differs markedly from that of adults in its adaptive and regenerative potential, making direct extrapolation of adult data unreliable. 20 In AIS, asymmetric loading and chronic postural imbalance may lead to selective atrophy or fatty infiltration of paraspinal muscles, especially on the concave side of the curve.20,21,22 Whether these preoperative muscular alterations influence perioperative parameters, such as blood loss, operative time, or early recovery, remains largely unexplored.
This study aims to investigate the relationship between preoperative lumbar muscle morphology, including CSA and fatty infiltration, and early postoperative outcomes in patients with AIS undergoing posterior spinal fusion (PSF). Additionally, it was evaluated whether these muscular characteristics differ across Lenke curve types and whether they have specific implications for surgical and short-term recovery metrics. By clarifying these associations, this study aims to determine whether preoperative lumbar muscle quality assessment may serve as a predictive marker of perioperative performance and function as an imaging-based surrogate marker for perioperative risk stratification in pediatric spinal deformity surgery.
Materials and Methods
Patient Population
The study cohort consisted of consecutive pediatric patients with AIS who underwent elective PSF between January 2019 and December 2023 at a single center. Two experienced senior surgeons performed all procedures. Prior to surgery, each patient underwent thorough pulmonary and cardiovascular assessments and received medical clearance. All patients had a minimum follow-up duration of 2 years.
Study Design
This retrospective observational cohort study was conducted in accordance with the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines. 23 As part of routine preoperative assessment, all patients scheduled for surgical correction underwent magnetic resonance imaging (MRI) to screen for potential intraspinal abnormalities. Preoperative T1-weighted axial lumbar spine MRI scans acquired preoperatively were retrospectively reviewed. Muscle morphometry was evaluated by analyzing both quantitative (muscle size) and qualitative (muscle composition) parameters bilaterally for several muscle groups: the posterior paraspinal muscles (erector spinae and multifidus), the anterior paravertebral muscle (psoas), and the lateral paravertebral muscle (quadratus lumborum). T1-weighted sequences were preferred over T2-weighted imaging due to their superior ability to differentiate muscle tissue from fatty infiltration, thereby enhancing the accuracy of muscle morphology assessment.24,25 Measurements were performed at the level of the inferior endplate of L3. The L3 vertebral level was selected as the standardized reference point, as it has been consistently demonstrated to provide a reliable and anatomically reproducible landmark for cross-sectional assessment of lumbar musculature in imaging-based morphometric analyses.26,27 Both radiologists were blinded to all clinical and perioperative outcome data at the time of image analysis.
Inclusion criteria were: (1) confirmed diagnosis of AIS; (2) age between 10 and 21 years; (3) availability of pre-operative T1-weighted MRI including the L3 vertebral level; and (4) minimum clinical follow-up of 24 months. Exclusion criteria included non-idiopathic etiologies, a history of previous spinal surgery, missing data, MRI studies affected by artifacts that precluded accurate muscle segmentation, imaging examinations that did not extend to the L3 vertebral level, MRI examinations performed at external institutions, as well as patients who experienced postoperative complications associated with a prolonged postoperative course, who were excluded to minimize potential confounding effects on early outcome measures.
Patients were categorized according to their curve pattern using the Lenke classification.1,2,3 Based on this, they were grouped into three categories: thoracic curves only (Group 1, Lenke 1 and 2), lumbar curves only (Group 2, Lenke 5), and double/triple curves (Group 3, Lenke 3, 4, and 6).
The primary objective of this study was to investigate the association between preoperative lumbar muscle morphology and early postoperative surgical outcomes in patients with AIS undergoing PSF, as well as to determine whether such associations vary across specific curve subtypes. A further objective was to identify clinically meaningful, MRI-based muscle cut-off values associated with increased perioperative risk, thereby exploring the potential role of preoperative lumbar muscle quality assessment as a predictive marker for perioperative performance in pediatric spinal deformity surgery.
Magnetic Resonance Imaging (MRI) Scan Protocol
Lumbar spine MRI examinations were obtained according to the routine institutional imaging protocol using 1.5-T Magnetom Sola systems (Siemens Healthineers, Germany), without any protocol customization for the purposes of this study. The standard imaging protocol comprised sagittal T1-and T2-weighted sequences covering the lumbosacral spine from T12 to S2, along with axial acquisitions extending from L1 through S1. Representative sequence parameters were as follows: sagittal T2-weighted imaging (TR/TE ≈ 3500/90 ms, slice thickness 4 mm); sagittal T1-weighted imaging (TR/TE ≈ 450-699/10-20 ms, slice thickness 4 mm); axial T2-weighted imaging (TR/TE ≈ 4000/90 ms, slice thickness 4 mm); and sagittal STIR sequences (TR/TE ≈ 3500/90 ms). Total acquisition time ranged from approximately 10 to 16 minutes.
Postoperative Protocol
All patients underwent an opioid-sparing analgesic regimen. Early mobilization was encouraged, with patients assisted to sit after the first 24 hours and to stand and ambulate with assistance after the first 48 hours following surgery. The subfascial drain and urinary catheter were removed 48 hours postoperatively. Antibiotic prophylaxis with intravenous cefazolin was administered intraoperatively and continued for 48 hours postoperatively. A standing anteroposterior and lateral spinal radiograph was performed 48 hours after ambulation to verify correction and implant position.
Study Variables and Outcome Measures
Study Variables
Preoperative data collected included age, sex, American Society of Anesthesiologists (ASA) Score, Lenke type, Body Mass Index (BMI) and Risser Stage.
Muscle parameters and measurement methods
CSA: Cross-Sectional Area; PMI: Psoas Muscle Index; PPMI: Paraspinal Muscle Index; PVR: Psoas-Vertebral Ratio; PPVR: Paraspinal-Vertebral Ratio; PPMR: Psoas-Paraspinal Muscle Ratio; GC: Goutallier Classification; %Fat: Intramuscular Fat Fraction; FA: Fat Area; %FCSA: Functional Cross-Sectional Area Percentage; FCSA: Functional Cross-Sectional Area; MRI: Magnetic Resonance Imaging.

Example of T2-weighted axial images showing the grading of fatty infiltration of paraspinal muscles according to the Modified Goutallier Classification (MGC). Grade 0: No fatty infiltration, Grade 1: Few fatty streaks within the muscles, Grade 2: Less than 50% fat within the muscle, Grade 3: 50% of fat within the muscles, Grade 4: More than 50% fat within the muscle. Reproduced from Mandelli et al. 28 under the Creative Commons Attribution License (CC BY), Copyright © 2021 Mandelli, Nüesch, Zhang, Halbeisen, Schären, Mündermann and Netzer
Radiographic assessments included preoperative measurements of main and minor curve magnitudes (Cobb angles) and curve flexibility (%), as well as postoperative measurements of main and minor curves with calculation of corresponding correction rates (CRs). Instrumentation data collected comprised the number of fused levels, total pedicle screws and number of pedicle screws per level. Surgical outcomes included estimated blood loss (EBL, ml), length of hospital stay (LOS, days), surgery duration (min), and time to assisted ambulation(days). EBL was estimated by the anesthesia team based on suction volume and sponge weight.
Figure 2 shows ImageJ-based ROI segmentation and threshold-based %Fat computation on axial T1-weighted MRI at the inferior endplate of L3, along with the main clinical outcomes of interest. Example of lumbar muscle segmentation, quantitative MRI analysis, and associated clinical outcomes. Upper panel (from left to right): (A) sagittal T1-weighted MRI centered on the L3 vertebral body, indicating the level of the axial slice; (B) native axial T1-weighted image acquired at the inferior endplate of L3; (C) manual region-of-interest (ROI) segmentation of lumbar muscles performed using ImageJ, including bilateral psoas (PS), erector spinae (ES), multifidus (MF), and quadratus lumborum (QL); (D) threshold-based segmentation map highlighting fatty infiltration (FI), used to calculate fat percentage (%Fat) and derive quantitative parameters such as fat area (FA) and functional cross-sectional area (FCSA). Lower panel: schematic representation of the main perioperative outcomes analyzed in the study, including estimated blood loss (EBL, ml), length of hospital stay (LOS, days), surgery duration (min), and time to assisted ambulation (days)
Outcome Measures
Curve flexibility of the major and minor curves, which was considered the main predictor of curve stiffness, was calculated as follows: • ((Preop static major/minor curve-Preop major/minor curve on bending films)/Preop static major/minor curve)*100.
The postoperative CR of the major and minor curves were calculated as follows: • ((Preop static major/minor curve-Post-halo/Postop/Last FUP major/minor curve)/Preop static major/minor curve)*100.
Interobserver Reliability
All muscle measurements were performed by two trained pediatric radiologists. Interobserver reliability for continuous variables (CSA and %Fat) was assessed using the intraclass correlation coefficient (ICC [2,1], two-way random-effects model with absolute agreement), with values of 0.75-0.90 indicating good agreement and ≥0.90 indicating excellent agreement. 30 Agreement for MGC was evaluated using quadratic weighted Cohen’s kappa. Results are reported with 95% confidence intervals (CIs), and only paired measurements with complete data from both observers were analyzed.
Statistical Analysis and Ethics
Data analysis was performed using RStudio version 4.4.3. Continuous variables are reported as mean ± standard deviation (SD) and range, and normality was assessed using the Shapiro–Wilk test. Between-group comparisons were conducted using one-way ANOVA for normally distributed variables and the Kruskal–Wallis test for non-normally distributed variables. Categorical variables were compared using the chi-square test or Fisher’s exact test, as appropriate. Associations between preoperative paraspinal muscle characteristics and perioperative outcomes were examined using a two-step approach. Univariate relationships were initially assessed with Spearman’s correlation analysis, after which multivariable linear regression models were constructed to identify independent predictors within each Lenke group. Models were adjusted for age, BMI, and surgery duration (included as a covariate to isolate muscle effects from surgical complexity). Multicollinearity was checked using the Variance Inflation Factor (VIF), ensuring no redundant variables were included (VIF < 5). An exploratory clinical threshold analysis was performed to identify actionable “risk profiles” using receiver operating characteristic (ROC) curve analysis. “High-risk” events were defined as outcome values above the 75th percentile of the respective group distribution. For each muscle-related predictor, the optimal cut-off value was determined using the Youden index to maximize the combined sensitivity and specificity. Statistical significance was set at P < .05.
The study was conducted in accordance with international ethical guidelines for clinical research, as outlined in the Helsinki and Istanbul Declarations. As this study involved retrospective analysis of existing medical records with no patient intervention or alteration of clinical management, formal IRB approval was not required, and waived in accordance with institutional policy.
Results
Patient and Group Selection
Between January 2019 and December 2023, 447 pediatric patients underwent surgical correction. Of these, 201 were excluded due to neuromuscular, congenital, or syndromic etiology, two-stage procedures, missing data, or age outside the inclusion range, leaving 246 AIS patients who underwent single-stage PSF. In-site preoperative MRI was available 83 patients. Two additional patients were excluded due to postoperative complications (one superficial infection and one dural tear), both of which considerably prolonged the LOS. Therefore, the final study cohort comprised 81 AIS patients. These were further stratified into Group 1 (Lenke 1-2, n = 30), Group 2 (Lenke 5, n = 14), and Group 3 (Lenke 3-4-6, n = 37). The patient selection process is summarized in Figure 3. Patients’ selection
Demographic and Anthropometric Characteristics
Demographic Characteristics
ASA: American Society of Anesthesiologists Physical Status Classification; F: Female; M: Male.
Anthropometric Characteristics
BMI: Body Mass Index. Data are presented as mean ± standard deviation (range). P-values were computed using the Kruskal–Wallis test.
Interobserver Reliability
Interobserver Reliability for Muscle Measurements (Intraclass Correlation Coefficient/Quadratic Weighted Cohen’s Kappa)
ICC: intraclass correlation coefficient (two-way random-effects model, absolute agreement, single measurements); κw: quadratic weighted Cohen’s kappa. CSA: cross-sectional area; GC: Goutallier classification.
Muscle Morphology Characteristics
Preoperative Muscle Characteristics
CSA: Cross-sectional area; GC: Goutallier classification; PMI: Psoas muscle index; FCSA: fat cross-sectional area; PVR: psoas-to-vertebral body ratio. Data are presented as mean ± standard deviation (range). P-values were computed using the one-way ANOVA or Kruskal–Wallis tests.
Radiographic Outcomes
Radiographic Characteristics
CR: Correction rate. Data are presented as mean ± standard deviation (range). P-values were computed using the one-way ANOVA or Kruskal–Wallis tests.
Operative and Inpatient Data
Surgical and Hospitalization Data
LOS: Length of stay; EBL: Estimated Blood Loss. Data are presented as mean ± standard deviation (range). P-values were computed using the one-way ANOVA or Kruskal–Wallis tests.
Muscle Morphology and Perioperative Outcomes Correlation Analysis
Detailed results of the univariate Spearman correlation analyses are reported in Supplemental Table 1, whereas multivariable linear regression analyses are reported in Supplemental Table 2. In Supplementary Table 2, regression coefficients are presented with standard errors, 95% confidence intervals, exact p-values, and model sample sizes. Significant multivariable associations are summarized in Figure 4. Significant multivariable linear regression predictors of perioperative outcomes stratified by Lenke subgroup. Forest plots show statistically significant associations between lumbar muscle morphology parameters and estimated blood loss, surgery duration, time to assisted ambulation, and length of hospital stay. Points represent β coefficients and horizontal bars represent 95% confidence intervals. Red markers indicate Lenke 5 curves and blue markers indicate Lenke 3-4-6 curves. Only predictors with P < 0.05 are shown for clarity; complete multivariable model results, including beta coefficients, standard errors, 95% confidence intervals, exact P-values, and model sample sizes, are reported in Supplementary Table 2
Univariate Correlation Analysis
Univariate analysis showed limited, group-specific associations between muscle parameters and perioperative outcomes. In Group 2, muscle quality metrics were associated with surgery duration, blood loss, and time to ambulation (P < .05), while in Group 3 significant correlations were mainly restricted to psoas fat–related parameters. No associations regarding Group 1 were found.
Multivariable Correlation Analysis of Independent Predictors
Group 1: Thoracic Curves (Lenke 1-2)
In patients with non-structural lumbar curves (Lenke 1-2), neither muscle size nor muscle quality parameters were associated with perioperative outcomes, including EBL, LOS, or operative time (all P > .05).
Group 2: Thoracolumbar/Lumbar Curves (Lenke 5)
In patients with single structural lumbar curves (Lenke 5), posterior muscle mass and anterior muscle quality independently predicted perioperative outcomes. Higher multifidus, paraspinal and quadratus lumborum FCSA areas (β = 98.06 (P = .001), β = 17.49 (P = .017) and β = 29.3 (P = .014), respectively) and PPMI (β = 48.70, P = .009) were associated with increased EBL, whereas a higher PPMR predicted reduced EBL (β = −567.98, P = .001). Increased psoas %Fat was associated with delayed ambulation (β = 0.236, P = .005), whereas higher psoas %FCSA was associated with shorter time to ambulation (β = −0.236, P = .005).
Group 3: Complex Curves (Lenke 3-4-6)
In patients with double or triple curves (Lenke 3-4-6), associations between posterior muscle degeneration, somatic constitution, and surgical burden were more heterogeneous. Greater fatty infiltration of the posterior musculature was independently associated with prolonged operative duration, with both paraspinal FA (β = 8.42, P = .015) and multifidus FA (β = 23.80, P = .012) emerging as significant predictors. Additionally, the PMI was positively associated with intraoperative blood loss (β = 89.08, P = .048).
Clinical Threshold Analysis
Receiver Operating Characteristic (ROC)-Derived Cut-Off Values for Perioperative Risk Stratification and Clinical Interpretation
ROC: receiver operating characteristic; AUC: area under the curve; FCSA: functional cross-sectional area; CSA: cross-sectional area; FA: fat area; PMI: psoas muscle index. Sensitivity and specificity reported at the optimal Youden index cut-off. High-risk outcomes defined as values above the 75th percentile of the respective group distribution.

Receiver operating characteristic (ROC) curves for muscle-based predictors of high bleeding and prolonged surgery across Lenke subtypes
Discussion
This study evaluated the relationship between preoperative lumbar muscle morphology and early postoperative outcomes in AIS patients undergoing PSF, finding that muscle quality, rather than muscle size, was meaningfully associated with early postoperative outcomes. Muscle morphology–outcome relationships were strongly curve-pattern dependent. No associations between lumbar muscle metrics and perioperative outcomes were identified in Group 1 (Lenke 1-2), supporting the anatomical specificity of the analysis when surgical exposure is confined to the thoracic spine. In contrast, patients in Group 2 (Lenke 5) exhibited distinct, independent predictors of perioperative burden. Greater posterior muscle mass, reflected by increased multifidus, paraspinal, and quadratus lumborum FCSA as well as higher posterior PPMI, was independently associated with increased intraoperative blood loss, whereas a higher PPMR exerted a protective effect. Additionally, anterior muscle quality influenced early recovery, as increased psoas fatty infiltration predicted delayed ambulation, while higher psoas %FCSA was associated with earlier mobilization. In complex curve patterns (Lenke 3-4-6), the relationship between muscle morphology and surgical burden was more heterogeneous, likely reflecting the influence of multiple coexisting anatomical and procedural factors inherent to complex curve patterns. Posterior muscle degeneration emerged as the dominant determinant, with greater fatty infiltration of the paraspinal and multifidus muscles independently predicting prolonged operative duration, likely reflecting the increased curve severity and surgical complexity characteristic of these deformity patterns. Furthermore, somatic constitution, as reflected by a higher PMI, was associated with increased intraoperative EBL, underscoring the multifactorial nature of surgical complexity in double- and triple-curve patterns. ROC analysis identified clinically actionable preoperative thresholds for risk stratification. In Lenke 5 curves, posterior muscle hypertrophy reliably discriminated patients at risk for high blood loss (>400 mL), whereas in Lenke 3-4-6 curves, paraspinal fatty degeneration and elevated muscle indices identified patients at increased risk for prolonged operative time (>284 minutes) and excessive bleeding (>800 mL). Overall, these findings suggest that muscle quality, more than muscle size, is a curve-dependent determinant of perioperative complexity and early recovery in AIS surgery. This conclusion requires qualification: in Lenke 5 curves specifically, posterior muscle size (as reflected by FCSA) was itself a significant predictor of intraoperative blood loss, underscoring that the relative importance of muscle size versus quality varies by curve pattern. Across the full cohort, however, qualitative parameters, particularly fatty infiltration, demonstrated broader and more consistent associations with perioperative outcomes.
In adult spinal surgery, sarcopenia and muscle fatty degeneration are well-established predictors of adverse outcomes.10,14,15 Studies by Bokshan et al demonstrated that reduced psoas CSA independently predicted longer LOS, higher complication rates, and increased morbidity following thoracolumbar fusion. 15 Similarly, Hirase et al reported sarcopenia as an independent risk factor for perioperative complications. 16 Ferrero et al also linked poor paraspinal muscle quality to postoperative imbalance in adult deformity.1 11 Conversely, McKenzie et al found no significant effect of sarcopenia on outcomes in less-invasive lumbar procedures, suggesting that the clinical impact of muscle loss may depend on surgical complexity and patient frailty status. 17
When stratified by curve type, our preliminary evidence suggested distinct patterns of muscular involvement. Main thoracic curves (Group 1) showed relatively preserved lumbar muscle morphology and no association with intraoperative metrics or LOS. In contrast, lumbar (Group 2) and double/complex (Group 3) curves were associated with longer LOS (7.2 ± 0.8 and 7.7 ± 1.4 days) compared with Group 1 (6.6 ± 0.8 days, P < .05). These findings suggest that lumbar-predominant deformities may impose greater mechanical stress on the psoas and posterior extensor muscles, promoting localized muscle remodeling.
Consistent with findings from adult populations, muscle morphology in our cohort was also associated with EBL and operative time.13,18,31 Notably, neither BMI nor age correlated significantly with CSA or fatty infiltration (all P > .1), supporting the interpretation that the observed muscular changes are primarily curve-related rather than metabolic or age-dependent. 31
The greater fatty degeneration observed in Group 2 and 3 curves may result from chronic asymmetric loading and compensatory muscular adaptation in the lumbar region.20,21,31 Increased torsional and shear forces in these curves place continuous stress on the multifidus, erector spinae, and psoas muscles, leading to localized disuse atrophy on the concave side and compensatory hypertrophy or fiber-type transformation on the convex side, as described by Stetkarova et al and Becker et al, representing a biomechanical adaptation rather than a degenerative process.21,31 This pattern suggests that even subtle, subclinical alterations in muscle morphology may translate into meaningful differences in early postoperative recovery, despite the absence of systemic frailty or sarcopenia. Consistently, in our cohort, Group 2 patients with greater psoas fatty infiltration experienced delayed ambulation, whereas a higher psoas %FCSA was associated with earlier postoperative mobilization.15,16,18,19 Interestingly, total muscle CSA did not show a clear correlation with perioperative parameters or LOS, suggesting that muscle quality, rather than quantity, is clinically relevant. This finding aligns with the adult data from Urakawa et al and Bokshan et al, who emphasised fatty infiltration and low attenuation as more sensitive predictors of outcome than CSA alone.10,15
The preliminary findings of this study underscore the potential clinical relevance of preoperative muscle assessment in AIS. Although traditional predictors, such as curve magnitude and skeletal maturity, are valuable, MRI-based muscle evaluation may enhance prediction of early postoperative outcomes and could serve as a valid variables in future risk models, similar to the current use of sarcopenia and muscle quality indices in adult spinal deformity surgery.7,32,33,34 In this context, our ROC analyses identified curve-specific preoperative muscle cut-off values associated with increased intraoperative EBL and prolonged operative time, suggesting a potential role for muscle morphology in perioperative risk stratification. Nevertheless, these thresholds should be considered exploratory, and larger prospective studies are still required to validate their clinical applicability and generalizability. Furthermore, the use of the 75th percentile to define “high-risk” outcomes, while pragmatic in the context of a small pilot cohort, is inherently arbitrary, and the modest discriminatory ability of these thresholds (AUC range 0.56-0.75) precludes their direct clinical application without further validation. Furthermore, identifying patients with greater muscle fatty infiltration may allow for tailored rehabilitation protocols, earlier physiotherapy, or targeted prehabilitation strategies aimed at optimizing muscle function prior to surgery, as supported by our finding that Lenke 5 patients with higher muscle %Fat experienced a significantly longer time to ambulation. 35
Strengths and Limitations
This study presents the first investigation evaluating the relationship between preoperative lumbar muscle morphology and early postoperative outcomes in patients with AIS undergoing PSF. The selection of the L3 vertebral level as a standardized reference point is a methodological strength of this study, as it has consistently provided a reliable, anatomically reproducible landmark for cross-sectional assessment of the lumbar musculature in imaging-based morphometric analyses. Muscle measurements were independently performed by two experienced pediatric radiologists, and interobserver reliability was formally assessed using intraclass correlation coefficients, supporting the robustness of the morphometric analysis. The inclusion of detailed, threshold-based muscle parameters represents a methodological strength of this study, as it provides a quantitative framework for assessing muscle quality in this young population, as demonstrated by the higher interobserver reliability of these measures compared with qualitative muscle assessments (MGC). Furthermore, patients were stratified by Lenke curve type, allowing for curve-specific comparisons rather than pooling heterogeneous deformities. This represents a relevant strength, as most existing studies on paravertebral muscle morphology in AIS predominantly focus on isolated thoracic curves and do not systematically evaluate lumbar or complex curve patterns, thereby limiting the understanding of muscle–deformity interactions across the full spectrum of AIS presentations. Furthermore, excluding patients who experienced early postoperative complications helped mitigate potential confounding bias, allowing recovery outcomes to more accurately reflect physiological and functional differences rather than complication-related effects.
However, some limitations should be acknowledged. First, the retrospective and single-center design limits causal inference and external generalizability. Second, the relatively small sample size reduces statistical power and precludes the inclusion of all potentially relevant confounders in multivariable models in order to avoid model overfitting. In particular, subgroup analyses in Group 2 (n = 14) are limited in statistical power, and the multivariable models in this subgroup should be interpreted with caution given the risk of overfitting.Third, although the L3 vertebral level is a widely validated and commonly used reference, it does not account for vertebral rotation and curve-related anatomical distortion inherent to spinal deformity, which may have affected muscle morphology measurements obtained at a single axial level. Moreover, only early recovery variables were analyzed, without long-term functional or radiographic follow-up. Furthermore, potential confounders for early postoperative recovery, such as preoperative physical activity, nutritional status, and postoperative pain levels, were not evaluated and may have influenced functional outcomes, particularly time to ambulation and length of hospital stay. Importantly, the Modified Goutallier Classification showed lower and more variable interobserver agreement compared with quantitative measures, particularly for the psoas and quadratus lumborum muscles. This reduced reliability represents a notable limitation of qualitative muscle grading and should be considered when interpreting findings based on this parameter. Future studies should prioritize quantitative, threshold-based fat quantification approaches, which demonstrated superior reproducibility in this cohort. Finally, although several studies have emphasized the importance of concave–convex asymmetry in AIS, the present analysis used the mean value of both sides to assess overall muscle quality. This approach was chosen to reflect overall muscular condition rather than side-specific adaptation; however, asymmetry-related features may still hold prognostic relevance and warrant further investigation. Additionally, the Lenke-based stratification employed in this study implicitly controls for the major sources of between-group variation in surgical complexity, including number of fused levels, instrumentation density, and curve magnitude, as these parameters are inherently correlated with curve type by definition. Adding them as covariates within already small subgroups would introduce substantial multicollinearity and increase the risk of overfitting, which informed the decision to limit covariate adjustment to age, BMI, and surgery duration.
Conclusion
Preoperative lumbar muscle morphology is associated with perioperative complexity and early postoperative recovery in AIS patients undergoing PSF, in a curve-pattern–dependent manner. Muscle quality showed greater clinical relevance than muscle size and influenced operative duration, blood loss and time to ambulation in lumbar and complex curves, while no associations were observed in isolated thoracic deformities. These findings may support the future integration of MRI-based muscle assessment into preoperative planning and risk stratification algorithms for AIS surgery. However, prospective validation in larger, multicenter cohorts is required to confirm these associations and to establish their clinical applicability across different patient populations and surgical settings.
Supplemental Material
Supplemental Material - Lumbar Muscle Morphology Correlates With Early Surgical Outcomes in Adolescent Idiopathic Scoliosis: A Pilot Study
Supplemental Material for Lumbar Muscle Morphology Correlates With Early Surgical Outcomes in Adolescent Idiopathic Scoliosis: A Pilot Study by Paolo Brigato, Sergio De Salvatore, Davide Palombi, Leonardo Oggiano, Camilla Ravaioli, Michele Inverso, Lorenzo Maria Gregori, Andrea Magistrelli, Gianluca Vadalà, Rocco Papalia, Lisbet Haglund, Jean Albert Ouellet, Pier Francesco Costici, in Global Spine Journal
Supplemental Material
Supplemental Material - Lumbar Muscle Morphology Correlates With Early Surgical Outcomes in Adolescent Idiopathic Scoliosis: A Pilot Study
Supplemental Material for Lumbar Muscle Morphology Correlates With Early Surgical Outcomes in Adolescent Idiopathic Scoliosis: A Pilot Study by Paolo Brigato, Sergio De Salvatore, Davide Palombi, Leonardo Oggiano, Camilla Ravaioli, Michele Inverso, Lorenzo Maria Gregori, Andrea Magistrelli, Gianluca Vadalà, Rocco Papalia, Lisbet Haglund, Jean Albert Ouellet, Pier Francesco Costici, in Global Spine Journal
Footnotes
Ethical Considerations
According to the policies of our institution and in line with national and international regulations (specifically Regulation European Union 2016/679 General Data Protection Regulation, GDPR, Italian Legislative Decree 101/2018, and the guidelines issued by the Italian Data Protection Authority) ethical approval, institutional review board and patient consent are not required for retrospective studies using fully anonymized data collected as part of standard clinical practice.
Author Contribution
Conceptualization, P.B.; methodology, P.B. and S.D.S; validation, L.O., A.M. L.M.G., and PF.C.; formal analysis, P.B. and S.D.S.; investigation, P.B., L.M.G., A.M., C.R., M.I.; data curation, P.B., C.R. D.P. and M.I.; writing—original draft preparation, P.B., S.D.S. and D.P.; writing—review and editing, L.H., J.A.O., R.P., G.V. and P.F.C.; visualization, L.O., L.M.G. and P.F.C.; supervision, L.O., R.P., A.M., J.A.O. and P.F.C.; project administration, P.B. and S.D.S. All authors have read and agreed to the published version of the manuscript.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
The datasets used and/or analyzed during the current study are not publicly available due to our policy statement of sharing clinical data only on request but are available from the corresponding author on reasonable request.
Supplemental Material
Supplemental material for this article is available online.
Appendix
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
