Abstract
BACKGROUND:
The effective treatment of breast cancer in elderly patients remains a major challenge.
OBJECTIVE:
To construct a nomogram affecting the overall survival of triple-negative breast cancer (TNBC) and establish a survival risk prediction model.
METHODS:
A total of 5317 TPBC patients with negative expression of estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER2) who were diagnosed and received systematic treatment from 2010 to 2015 were collected from the American Cancer Surveillance, Epidemiology and End Results (SEER) database. They were randomly divided into training set (
RESULTS:
The C-indices of the nomograms for OS and BCSS in the training cohort were 0.797 and 0.825, respectively, whereas those in the validation cohort were 0.795 and 0.818, respectively. The receiver operating characteristic (ROC) curves had higher sensitivity at all specificity values as compared with the Tumor Node Metastasis (TNM) system. The calibration plot revealed a satisfactory relationship between survival rates and predicted outcomes in both the training and validation cohorts. DCA demonstrated that the nomogram had clinical utility when compared with the TNM staging system.
CONCLUSION:
This study provides information on population-based clinical characteristics and prognostic factors for patients with triple-negative breast cancer, and constructs a reliable and accurate prognostic nomogram.
Keywords
Introduction
The prevalence of breast cancer is age-dependent, such that older age is associated with increased morbidity [1]. The effective treatment of breast cancer in elderly patients remains a major challenge. Older breast cancer survivors contend with health issues and concerns around wellbeing that are related to cancer treatment, the anagen process, and other considerations around illness, treatment, and survivorship. Health concerns among breast cancer survivors include concomitant diseases, osteoporosis, a range of treatment-associated symptoms, effects on physical functioning, concerns around cognitive function, nutritional deficits, and reduced physical activity [2].
Older women are more frequently diagnosed with estrogen-driven tumors, which have a more favorable prognosis and can be effectively treated with targeted hormone therapy [3]. Although the proportion of more favorable breast cancer subtypes increases with age, a high proportion of older patients are still diagnosed with high-risk tumors [4]. According to the International Organization for Geriatric Oncology (SIOG) recommendations, the old Oncology patients were Classified as two age groups for age at diagnosis (
Triple-negative breast cancer (TNBC) is characterized by the absence of estrogen receptor (ER) and progesterone receptor (PR) expression as well as a lack of human epidermal growth factor receptor 2 (HER2) overexpression; this cancer subtype comprises approximately 10–15% of all diagnosed breast cancer subtypes as well as more than one-third of all breast cancer-related deaths [9, 10, 11]. TNBC is difficult to treat and is associated with a poor prognosis resulting from a lack of validated targeted treatments concomitant with the highly aggressive biological behavior of this cancer subtype [12, 13]. TNBC is one of the most invasive subtypes of breast cancer and, as suggested above, one of the most challenging to treat [14]. Based on the clinical complexity of older breast cancer patients and the highly aggressive nature of triple-negative breast cancer, there is a need to create a predictive survival instrument in order to facilitate effective clinical decision-making within this population.
The Surveillance, Epidemiology, and End Results (SEER) program is a cancer registry that covers approximately 30% of the US population. Through the National Data Center associated with SEER, this database contains comprehensive patient information, including demographic and clinical data as well as comprehensive follow-up data. We note that TNBC occurring in older patients is a specific category of triple negative breast cancer.
In this study, we used the SEER database to evaluate clinical characteristics and prognostic factors with regard to TNBC occurring in elderly women. Nomograms have been identified as a novel and reliable tool that incorporate information on demographic and clinicopathological factors in order to make accurate prognostic predictions for many cancer types. Studies have reported on these synthetic and visualized predictive models (i.e., nomogram prediction models) with regard to a wide range of cancers [15, 16, 17, 18]. The objective of the current study was to establish a nomogram accurately and reliably predicting overall survival (OS) and breast cancer-specific survival (BCSS) in elderly patients with triple-negative breast cancer.
Materials and methods
Data collection
The data in this study were obtained from the SEER-linked database, which collects cancer incidence data from 18 population-based cancer registries. SEER-Stat software (version 8.3.9, National Cancer Institute, Bethesda, Maryland, USA) was used to filter and collect information on representative patients (see
Cohort selection
The cases selected in our study were elderly women (
Flowchart describes inclusion criteria and assignment of training and validation cohorts.
The patients were randomly divided into training and validation cohorts at a ratio of 7:3 (training cohort
For validation of the final nomogram, the Harrell concordance index (C-index) was used to verify predictive precision [20]. The value of the C-index can range from 0.5 to 1.0, with 0.5 indicating random chance and 1.0 indicating perfect ability to accurately distinguish outcomes using the model. Prognostic predictive ability was validated through a receiver operating characteristic (ROC) curve used for assessing the area under the curve (AUC), constructing calibration curves, and conducting decision curve analysis (DCA). We plotted Kaplan-Meier curves based on the tertile score stratifications predicted from the model and conducted comparisons using the log-rank test.
The cutoff values for the predicted tertile score stratifications were determined using X-tile software (version 3.6.1; Yale University School of Medicine, New Haven, CT, USA) [21]. The elementary cohort was randomly separated into training and validation cohorts using R statistical software (version 4.1.2,
Results
Clinicopathological characteristics in the training and validation cohorts
A total of 5,317 women aged 70 years or older who were diagnosed with triple-negative breast cancer between 2010 and 2015 were selected from the SEER database and were randomly separated into two cohorts, a training cohort (
The demographic and baseline characteristics of the Cohort with 70 years or older with triple-negative breast cancer
The demographic and baseline characteristics of the Cohort with 70 years or older with triple-negative breast cancer
Univariate and multivariate analysis with Cox regression for OS
Nomograms for predicting 1-, 3-, and 5-year OS (A) and BCSS (B) for patients with the prognosis factors. The total points are calculated by summing up the points for each factor. The predicted probabilities of OS and BCSS can be obtained by projecting the location of the total points to the bottom scales. OS, overall survival; BCSS, breast cancer-specific survival. 
Univariate and multivariate analysis with cox regression for BCSS
Univariate and multivariate Cox proportional hazard regression analyses were used to explore prognostic factors for OS and BCSS. Following this, all factors showing statistical significance in the univariate analysis were subjected to multivariate analysis within a Cox regression model The factors independently associated with OS in the multivariate analysis were age, grade, T stage, N stage, surgery, radiation, chemotherapy, bone metastases, lung metastases, liver metastases, and brain metastases (all
R statistical software was applied to develop a nomogram for OS and BCSS (i.e., in order to predict patient prognoses), and all of the major variables specified above were used to establish the nomograms for OS and BCSS. Nomograms constructed based on the final Cox models were used to predict survival probabilities (i.e., 1 -, 3 -, and 5-year OS and BCSS) for older patients with TNBC (Fig. 2). By aggregating the total scores from all the aforementioned variables and casting these scores to the total point scale, we were able to predict outcome probabilities for OS and BCSS in older patients with TNBC.
Calibration and validation of the prognostic nomograms
C-index values, calibration curves, ROC curves, and DCA curves were used to identify the efficacy of the predictive nomograms. In the training cohort, the C-index for the OS nomogram was 0.797 (95% confidence interval [CI], 0.784–0.810) and the C-index for the BCSS nomogram was 0.825 (95% CI, 0.811–0.840). In the validation cohort, the OS C-index for the nomogram was 0.795 (95% CI, 0.775–0.814) and the BCSS nomogram index was 0.818 (95% CI, 0.795–0.841). These results indicate that the nomograms were repeatable and accurately predicted both OS and BCSS.
Calibration curves for the 1-, 3-, and 5-year. (A, B, C) calibration curves for OS in the training cohort; (D, E, F) calibration curves for OS in the validation cohort; (G, H, I) calibration curves for BCSS in the training cohort; (J, K, L) calibration curves for BCSS in the validation cohort. OS, overall survival; BCSS, breast cancer-specific survival. 
ROC curves for the 1-, 3-, and 5-year. (A, B, C) ROC curves for BCSS in the training cohort; (D, E, F) ROC curves for OS in the validation cohort; (G, H, I) ROC curves for BCSS in the training cohort; (J, K, L) ROC curves for OS in the validation cohort.OS, overall survival; BCSS, breast cancer-specific survival,ROC, receiver operating characteristic curve; AUC, areas under the ROC curve. 
Moreover, in both the training and validation cohorts, the calibration curves of nomograms predicting 1-, 3-, and 5-year OS and BCSS as well as the probability of recurrence indicated that the model had good predictive value (Fig. 3). The 1-, 3-, and 5-year ROC results showed that the sensitivities and specificities of the OS and BCSS nomograms were higher than those of the AJCC TNM staging system in both the training and validation cohorts (Fig. 4). The 1-year, 3-year and 5-year AUC values for the validation cohort OS nomograms were improved as compared with prognostic evaluations conducted using the TNM staging system (86.8%, 81.7%, 79.4% vs. 82.3%, 76.0%, and 72.6%, respectively). In addition, in the validation cohort, the predictive performance of nomograms in terms of BCSS was superior to that of TNM staging (1-year BCSS: 88.0% vs. 85.9; 3-year BCSS: 83.9% vs. 81.1%; 5-year BCSS: 81.5% vs. 78.6%). Moreover, the DCA results suggested that the prognostic nomograms constructed herein had high net benefits with regard to clinical decision-making (Fig. 5).
DCA curves for the 1-, 3-, and 5-year. (A, B, C) DCA curves for BCSS in the training cohort; (D, E, F) DCA curves for OS in the validation cohort; (G, H, I) DCA curves for BCSS in the training cohort; (J, K, L) DCA curves for OS in the validation cohort. DCA, decision curve. 
Survival of older TNBC patients according to different risk groups. In the validation cohort, OS (A) in nomogram-based low-, moderate- and high-risk subgroups; BCSS (B) in nomogram-based low-, moderate- and high-risk subgroups. In the training cohort, OS (C) and BCSS (D) in nomogram-based low-, moderate- and high-risk subgroups, respective. DCA, decision curve analysis; TNBC, triple-negative breast cancer; OS, overall survival; BCSS, breast cancer-specific survival. 
To evaluate the discrimination capability of the model, we stratified by tertiles of the predicted probability scores within the nomograms and then plotted these findings as Kaplan-Meier curves for OS and BCSS (Fig. 6). According to the tertile cut-off values for the OS nomogram, the dataset was divided into low-risk (score
In terms of BCSS, the low-risk group had the highest BCSS rates of 97.9% at 1 year, 90.3% at 3 years, and 85.9% at 5 years, followed by the moderate-risk group (89.1%, 68.3%, and 56.7%) and the high-risk group (49.3%, 20.4%, and 8.8%). Similar results were observed in the validation cohort. Survival curves showed that risk stratification differentiated well between OS and BCSS in all subgroups (
Discussion
TNM staging is the most commonly implemented breast cancer staging system and is used as a basic prognostic factor within prognostic evaluations. Nomograms have been shown to provide a clearer prediction of prognoses as compared with TNM staging for some cancers [22, 23, 24]. It has been documented that nomograms play a key role in the prediction of breast cancer patients and can effectively distinguish the tumor area of breast cancer patients and can help in the diagnosis of breast cancer [25, 26]. In the present study, the derived nomograms were validated The C-indices of the OS and BCSS nomograms were statistically significantly higher in both the training and validation cohorts, showing better discrimination ability and improved results as compared with those reported in previous studies [27]. ROC analysis indicated that the novel nomogram presented better sensitivity and specificity with regard to predicting OS and BCSS in older patients with TNBC. In addition, calibration curve and DCA curve evaluations demonstrated good consistency as well as a favorable net clinical benefit with regard to the evaluated model. Several studies have shown that the accuracy of prognostic models can be statistically significantly improved by integrating AJCC staging with other clinical prognostic indicators [28], and T-staging and N-staging each played an important role in our nomogram drawings.
As a pivotal prognostic factor for breast cancer, distant metastasis sites are closely associated with poor survival outcomes. Our results suggest that bone metastasis, lung metastasis, and brain metastasis are independent risk factors that can increase the scope and accuracy of predictive nomograms. Bone metastasis is the most common type of metastasis occurring in breast cancer.
Kono et al. demonstrated that breast cancer patients with single bone metastases exhibited a longer OS than those with other single-organ metastases [29]. Moreover, the median survival time of metachronous patients following bone metastasis was 7.54 years in metachronous patients [30]. Lung metastases occurring in BC have been recorded as the second most commonly occurring metastasis in breast cancer, and are associated with a median survival time of 22 months after treatment [31]. The median survival time of BC patients exhibiting untreated metastasis to the liver is only approximately 4–8 months [31]. As compared with other common cancer types, brain metastasis occurring within breast cancer is rare, and the median associated survival time is only approximately 4 to 6 months [32].
In this study, the hazard ratios for OS and BCSS exhibited an increasing trend from the 70–74 year subgroup to the 75–79 year, 80–84 year, and 85+ year subgroups. Other research groups have reported that BCSS is not as severe as OS in the subgroup of elderly breast cancer patients aged
Regarding treatment factors, we found that surgery, chemotherapy, and radiotherapy were independent prognostic factors for the progression and survival of older women with triple-negative breast cancer. However, chemotherapy did not lead to improved BCSS within our study. As shown in our model, among older TNBC patients, surgical resection of primary tumors was found to statistically significantly prolong both OS and BCSS. Some researchers have shown that the addition of chemotherapy to treatment regimens for older patients with TNBC offers substantial benefits in terms of overall survival [35]. However, chemotherapy was not an independent prognostic factor for BCSS in our study, consistent with the findings of a previous investigation [36]. Studies have indicated that even healthy older women (who are recommended chemotherapy) do not tolerate this treatment as well as younger women, demonstrating an increased likelihood of potential side effects, hospitalizations, and short-term mortality [37, 38]. Studies have shown that older patients with T1-2N0 triple-negative breast cancer treated with postoperative radiotherapy might show better overall survival in studies controlling for different baseline clinical characteristics and employing propensity score matching [39].
Our data indicate that the novel nomograms developed herein have better prognostic value and predictive accuracy as compared with the TNM staging system in terms of evaluations in older patients with TNBC. Therefore, we conclude that the nomograms delineated in this study provide a more accurate individualized survival assessment for elderly patients with triple-negative BC.
Limitations
This study has several limitations. Firstly, detailed information on systemic treatment was not collected, especially with respect to specific chemotherapy regimens, radiotherapy regimens, targeted therapies, and hormone therapy. Secondly, the SEER database does not provide information on Ki-67, BRCA1/2 gene mutations, or polygenic status, all of which are important genetic factors as well as strong prognostic parameters with regard to BC trajectories [40]. Thirdly, this was a retrospective epidemiologic investigation, and selection bias may therefore be unavoidable. Fourth, we lacked external data validation with another independent large-scale dataset, but performed effective internal verification.
Conclusion
Based on a large population recruited from within the SEER database, we established nomograms predicting survival outcomes in elderly patients with triple-negative BC. These validated nomograms provide a visual prognostic assessment for each prognostic factor and will help physicians predict 1-, 3-, and 5-year OS and BCSS in elderly patients with triple-negative BC. Our findings inform future research direction as well as directly informing medical guidelines.
Ethics statement
Since the data were obtained from the SEER database, informed consent and ethical approval were not required.
Funding
This work was supported by the National Natural Science Foundation of China (No. 8217103088).
Author contributions
RF and BL contributed to the study design, data collection, and manuscript writing. XW and WH conceptualized the study and was involved in result interpretation and manuscript writing. RF and DL statistical analyzed the data. JZ, YY and XC contributed with a critical revision of the manuscript. All authors contributed to the article and approved the submitted version.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found in the Surveillance, Epidemiology, and End Results (SEER) database (
Footnotes
Acknowledgments
Not applicable.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
