Abstract
We present the first analysis examining molecular alterations detected utilizing a clinically available cell-free circulating tumor DNA (cfDNA) assay in a cohort of patients with advanced colorectal cancer (aCRC) diagnosed <50 versus ≥50 years of age. Patient characteristics and mutation frequencies were compared using cfDNA tests from 5873 patients. Patients <50 had more sequence alterations in APC, CTNNB1, and SMAD4, as well as a higher frequency of focal BRAF and ERBB2(HER2) amplifications. Our study adds further evidence suggesting that young- versus older-onset CRC may have distinct molecular underpinnings, with prognostic and therapeutic implications.
Background
Colorectal cancer (CRC) occurrence has declined over the past several decades; however, in younger individuals, there has been a notable increase in incidence.1–3 Younger patients are more likely to present with advanced disease, potentially due to lack of screening, delays in diagnosis, or more rapidly progressing disease.4,5 While data are conflicting, there is some evidence that younger patients may have worse prognosis and outcomes.4,5
A minority of patients with young-onset CRC have a hereditary syndrome, and while modern lifestyle and dietary trends have been identified as potential risk factors, an established cause for this increase has yet to be determined.6–8 Previous studies using tissue-based approaches have compared the frequency of somatic alterations in patients with young- versus older-onset CRC and found significant, yet inconsistent differences between cohorts, suggesting that young- versus older-onset CRC may have distinct molecular and clinical features.6,8–10 We aimed to describe differences in cell-free circulating tumor DNA (cfDNA) in CRC patients by age to further elucidate molecular differences between young- versus older-onset advanced CRC (aCRC).
Methods
We retrospectively analyzed a deidentified database containing results from consecutive patients treated within the United States with aCRC (stage IIIB or higher) who underwent clinical testing with a validated, next-generation sequencing (NGS) cfDNA assay (Guardant360®, Guardant Health) between October 2015 and March 2019.11,12 Patient age, sex, and cancer type were extracted from test request forms (TRF). If a patient had longitudinal testing, only their first chronological test was included. Since date of diagnosis was not consistently provided, we categorized patients by their age at the time of testing. Using the subset of samples with the age of diagnosis included, we assessed how many patients were incorrectly categorized as being in the ≥50 cohort when they were actually diagnosed at age <50. Using this same subset, we analyzed how many samples were drawn ≤6 months versus >6 months after the patient's diagnosis.
During the study period, the assay, which has been previously described, evolved from a 70- to a 73-gene panel (Supplementary Table S1). 12 Microsatellite instability high (MSI-H) status was available for a subset of cases, upon validation of its inclusion in the assay. 13 During the study period, the assay's bioinformatics pipeline implemented an aneuploidy distinction feature which identifies focal gene amplifications as those with a statistically higher copy number relative to other genes across the chromosome arm (see Supplementary Table S1 for the list of genes in which this differentiation can be made).
Clinical characteristics and mutation frequencies were compared between the <50 and ≥50 age groups, after excluding variants of uncertain significance and synonymous variants. Each gene was assessed by alteration type, including sequence alterations (single-nucleotide variants [SNV]/indels/splice site variants), amplifications, and fusions, with frequencies determined on a per-patient basis (patients with multiple sequence alterations in the same gene were counted once). Only patients with focal amplification determination data available were included in the denominator for calculation of amplification frequency. Sequence alterations were combined for the pathway analysis and pathway-level frequency was calculated using the total number of functional sequence alterations in that pathway divided by the overall number of sequence alterations identified (see Supplementary Table S2 for the genes included in each pathway). The maximum variant allele frequency (maxVAF) represents the tumor-derived alteration with the highest level of cfDNA in the sample (calculated based on the number of tumor-derived DNA molecules at each location divided by the total number of unique cfDNA molecules at the given nucleotide position). For select factors, subanalyses comparing patients aged <40 versus ≥40, patients aged <35 versus ≥70, men aged <50 versus women <50, and women aged <50 (premenopausal) versus women ≥50 were completed and these results are detailed in the Supplementary Data. Demographic and clinicopathological characteristics were compared between cohorts. Fisher's exact test was used to compare categorical variables and Wilcoxon rank sum tests were used for continuous variables.
This research was approved by the Quorum Institutional Review Board (IRB) for the generation of deidentified data sets for research purposes.
Results
During the study period, 5873 unique patients with aCRC underwent cfDNA NGS. The median age at the time of cfDNA blood draw was 60 years (range: 18–96). The majority of patients were male (56.1%), and there were significantly more men in the ≥50 cohort compared to the <50 cohort (57.0% vs. 52.7%; p = 0.0064; Table 1).
Patient Demographic and Sample Information
Bold values indicate p < 0.05.
Patient demographic and sample information for N = 5873 total unique tests from advanced colorectal cancer patients submitted for clinical testing via a cell-free circulating tumor DNA assay Between October 2015 and March 2019. Fisher's exact test was used to compare the frequency of male and female patients and the frequency of MSI-H samples. Wilcoxon rank sum tests were used to compare the median maxVAF and median number of alterations per sample.
cfDNA, cell-free circulating tumor DNA; maxVAF, maximum variant allele frequency; MSI-H, microsatellite instability high.
At least one cfDNA alteration was identified in 88.1% (n = 5174) of patients. While the median maxVAF was significantly higher in patients <50 (6.97 vs. 4.22; p < 0.0001), there was no significant difference in the median number of alterations per sample between cohorts (5 vs. 5; p = 0.5202). Microsatellite testing results were available for 2611 patients (51% of patients with ≥1 cfDNA alteration detected), and the frequency of MSI-H was higher in the <50 cohort, but not statistically significant (5.0% vs. 3.7%; p = 0.1646). BRAF V600E alterations were significantly more frequent in MSI-H patients who were ≥50 (36.4% vs. 7.7%; p = 0.0055; Supplementary Table S3).
Next, we examined the frequency of sequence alterations by gene, focusing on the 24 most frequently mutated genes in this cohort (Supplementary Fig. S1). Sequence alterations in TP53, APC, KRAS, and PIK3CA were the most frequent in both cohorts, consistent with previous molecular studies of CRC.10,14 There were significantly more alterations in ATM and TERT (promoter mutations only) in patients ≥50, and significantly more alterations in APC, SMAD4, ARID1A, PTEN, and CTNNB1 in patients <50 (Fig. 1A and Supplementary Fig. S2A, B). There was no significant difference in the frequency of gene fusions for any of the six genes analyzed on the cfDNA assay (Fig. 1B and Supplementary Fig. S2C, D). In total, 12.5% (2/16) of the BRCA1 alterations and 11.4% (4/35) of the BRCA2 alterations detected in patients age <50 were suspected germline, compared with 9.2% (6/65) of the BRCA1 alterations and 2.9% (3/104) of the BRCA2 alterations in patients age ≥50 (Supplementary Table S4). While numbers were limited for subanalyses of men versus women aged <50 (n = 549 and n = 495, respectively) and women aged <50 versus ≥50 (n = 495 and n = 1750, respectively), there were significant differences in the per-patient frequency of sequence alterations in notable genes, including, BRAF, KRAS, and APC (Supplementary Fig. S3A, B).

Comparison of frequency of alterations between advanced colorectal cancer patients aged <50 and ≥50.
Amplification data from 1771 patients (34%) underwent analysis using the aneuploidy distinction feature. Focal amplifications in BRAF, CCND1, and ERBB2(HER2) were significantly more frequent in patients <50 (Fig. 1C and Supplementary Fig. S2E, F).
Overall, the frequency of functional sequence alterations by pathway was largely the same between cohorts, however, there was a significantly higher frequency of alterations in the SWItch/Sucrose Non-Fermentable (SWI/SNF) pathway in patients aged <50 and a numerically higher number of alterations in the RAS/RAF/MAPK pathway in patients aged ≥50 (Fig. 2).

Frequency of sequence alterations by pathway. Numerator represents total number of functional alterations in the genes included in that pathway and denominator represents total number of sequence alterations identified in the corresponding age cohort (variants of uncertain significance and synonymous variants excluded from the numerator). Color images are available online.
In total, 41.4% of samples included the patient's age of diagnosis. Using these samples, we found that 6.3% of patients switched from the ≥50 to the <50 cohort when incorporating age of diagnosis (Supplementary Fig. S4). We also found that the majority of samples (74.4%) were drawn >6 months after the patient's diagnosis.
Discussion
There has been an increase in the incidence of young-onset CRC, with more individuals diagnosed with late-stage disease.15–17 Delays in diagnosis do not fully explain this difference, suggesting that there may be undiscovered clinical and/or molecular distinctions between young- and older-onset CRC, and thus examination of somatic genetics may provide insights into the underlying biology of these cancers.15,16 Previous research examining molecular differences between young- and older-onset CRC patients using tissue testing included variable age cutoffs, various stages, and have not consistently differentiated between mutation types (a summary of findings from these studies can be found in Supplementary Table S5). To our knowledge, this is the first study to comprehensively assess potential differences in genetic alterations identified via cfDNA in aCRC patients aged <50 versus ≥50 and details significant molecular differences between young- and older-onset aCRC patients, most notably in the RAS/RAF/MAPK pathway.
Activating alterations in KRAS, NRAS, and BRAF are predictive biomarkers of resistance to anti-epidermal growth factor receptor (EGFR) antibody therapy in both the primary and acquired resistance settings. 18 Current National Comprehensive Cancer Network (NCCN) guidelines recommend molecular testing for RAS genes and BRAF in all aCRC patients, and also include several recommended systemic therapy regimens involving BRAF ± MEK inhibitors for patients with BRAF V600E. 19 We found that KRAS alterations were significantly more frequent in patients ≥40, which is consistent with several previous studies, and sequence alterations in the RAS/RAF/MAPK pathway were numerically more frequent in patients ≥50.6,10 Interestingly, we found that focal BRAF amplification was significantly more frequent in patients aged <50.
Emerging data suggest that ERBB2(HER2) amplification may be a negative predictor of response to anti-EGFR therapy, and there are several ongoing clinical trials of ERBB2(HER2)-targeted therapy in ERBB2-amplified, RAS/RAF wild-type aCRC.20–22 Thus, the identification of more frequent ERBB2(HER2) amplifications in young-onset aCRC patients may have important implications in terms of therapy selection for these patients as well as patient selection for clinical trials.
In patients with microsatellite stable tumors, Lieu et al. found APC alterations to be more common in patients >50, while Willauer et al. found fewer APC alteration in patients aged 18 to 29 compared with other young-onset patients. In our analysis, we found the lowest frequency of APC alterations in patients <30 (47.3%) and the highest frequency in patients 40–49 (62.8%) and 50–59 (61.9%), suggesting that APC may play less of a role in very young-onset aCRC. Notably, CTNNB1 was significantly more common in patients <50, consistent with both Willauer et al. and Lieu et al.6,10
SMAD4, a member of the TGF-β pathway, has recently been associated with worse outcomes and, potentially, a predisposition to chemoresistance, particularly to 5-fluorouracil (5-FU)-based therapy. 23 While further research is needed to validate this work, our finding that SMAD4 alterations were significantly more frequent in patients <50 has potential future implications for therapy selection in these patients.
While there was no difference seen in the median number of alterations detected, the median maxVAF was significantly higher in patients <50. Overall, higher maxVAF has consistently been associated with a more significant disease burden, and higher cfDNA maxVAF has been associated with worse prognosis. 24 However, further studies are needed to better understand which factors may impact tumor cfDNA shed and to validate the prognostic value of this marker.
A common concern with cfDNA is potential contamination by alterations stemming from clonal hematopoiesis (CH), which can be difficult to differentiate from tumor-derived variants. To evaluate the potential impact of CH, we examined the frequency of JAK2 V617F in this cohort, as this alteration is highly characteristic of myeloproliferative neoplasms and is rare in CRC, and found that a small minority (0.6%) of patients had this alteration. 14 Of note, many of the genes with the highest rates of CH are not included on this cfDNA assay, and several studies have found low rates of CH in the genes currently actionable in CRC (e.g., BRAF, KRAS). 25
We acknowledge that this study has several limitations, predominantly related to the fact that it is a retrospective study and our knowledge of patient clinical history, including treatment regimens, is limited to what is provided by the ordering provider on the TRF. Using the 41.4% of samples that included the patient's age of diagnosis, we found that a small minority of patients (6.3%) were incorrectly classified as aged ≥50, and that the majority of samples (74.4%) were drawn >6 months after the patient's diagnosis, suggesting that these patients likely received prior therapy. While previous work has shown that the concordance between tissue and cfDNA analysis in treatment-naive aCRC patients is high when examining driver alterations, it seems likely that some of the alterations detected in this cohort (e.g., KRAS SNVs, BRAF amplifications) may represent acquired resistance alterations. 26 Data support the ability of cfDNA to identify the evolution of a tumor across time and treatment, and thus account for both spatial and temporal heterogeneity.14,20,26 Thus, cfDNA testing provides a unique perspective of aCRC, including the identification of intriguing potential differences in acquired resistance mechanisms between young- and older-onset aCRC.
In summary, this study highlights the utility of cfDNA analysis in assisting in the implementation of precision oncology by identifying molecular alterations with potential therapeutic implications in aCRC, and adds further evidence to suggest that there are distinct molecular differences between young- and older-onset CRC, which are relevant to emerging and currently available targeted therapies. Thus, the one size fits all models for CRC treatment may need to be reconsidered, and future work is needed to understand how these distinct molecular characteristics may better inform treatment of these patients.
Footnotes
Acknowledgments
The authors acknowledge Richard B. Lanman, Rebecca J. Nagy, Victoria M. Raymond, and Jennifer Saam for their criticial review of analyses and feedback on the article.
Author Disclosure Statement
C.M.W., C.R.E.—stockholders and employees of Guardant Health; A.B., H.-J.L. —No disclosures.
Funding Information
No funding was received.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
