Abstract
Regulators and industry are actively seeking improvements and alternatives to current models and approaches to evaluate potential carcinogenicity of gene therapies (GTs). A meeting of invited experts was organized by NC3Rs/UKEMS (London, March 2023) to discuss this topic. This article describes the consensus reached among delegates on the definition of vector genotoxicity, sources of uncertainty, suitable toxicological endpoints for genotoxic assessment of GTs, and future research needs. The collected recommendations should inform the further development of regulatory guidelines for the nonclinical toxicological assessment of GT products.
INTRODUCTION
The therapeutic potential of gene therapies (GTs) to address hitherto untreatable conditions has led to a rapidly increasing number of candidates entering the clinic, with more than 16 market authorizations and thousands of patients treated with GTs. 1 The regulatory framework for the assessment of GT products comprises both regulations and guidance documents that cover specific jurisdictions, and requiring the interpretation of these documents by regulators and sponsors through experience gained in practice. 2 –7 Approvals are given on a case-by-case basis and involve submission of a portfolio of evidence to support each investigational drug application before initiating clinical studies and throughout the clinical development process.
For the purposes of this article, GTs were defined as ex vivo or in vivo therapies that modify the genome with gamma-retroviral (γRV), lentiviral (LV), or adeno-associated viral (AAV) vectors or DNA transposons (e.g., PiggyBac and Sleeping Beauty). Genome editing (e.g., by CRISPR or zinc finger nucleases) or nanoparticles as delivery method are not covered. Similarly, we focused on autologous therapies (influenced by patient/disease context) as opposed to allogeneic treatments where a cell therapy is manufactured from a healthy donor, removing the influence of the preexisting patient genome alterations. In contrast, allogeneic products have an increased exposure risk from any manufacturing-driven genotoxic event, since a higher number of patients are treated from the same product batch.
Concerns about the risk of carcinogenicity (this term is considered equivalent to tumorigenicity or oncogenicity for the purposes of this article) associated with the use of integrating viral vectors were first raised in the early 2000s, when clinical trials of ex vivo GT for primary immune deficiencies were put on hold in the United States and France following several cases of leukemia. 8 –10 Since then, further clinical cases of leukemia/lymphoma (and myelodysplasia) definitively linked to vector integration, as well as cases of uncertain relationship to vector integration, have occurred. 11 –15 In 2023, the U.S. Food and Drug Administration (FDA) announced an investigation of the identified risk of T cell malignancy following BCMA-directed or CD19-directed autologous chimeric antigen receptor (CAR) T cell immunotherapies. While noting that the overall benefits of these products continue to outweigh their potential risks, in 2024, a class-wide black box warning in their labels was issued. 16,17 One of such secondary malignancies has been recently reported. 18 Levine et al. responded to the announcement of the investigation, highlighting that while safety concerns should be thoroughly investigated, existing data from follow-up studies suggest that the risk of T cell malignancies remains low when compared with existing cancer treatments. 19
Furthermore, various RV and LV, as well as AAV vectors, in vivo and/or ex vivo, have been associated with neoplasia in mouse models or with nononcogenic clonal expansion in dogs. 20 –23 These data exemplify the potential for carcinogenicity associated with various GT vectors and have led to increased regulatory oversight and safety precautions as well as research to produce safer vectors, even though carcinogenicity has not been reported following the use of AAV vectors in large-animal studies or in humans.
Guidance about genotoxicity (via insertional mutagenesis) and carcinogenicity risk assessment is generic, with no specific assays proposed. The traditional ICH (International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use) S1 test battery and life-time animal bioassays were designed for small molecules and are generally not applicable and insufficient for detection of potential carcinogenicity in humans for GTs. Therefore, more tailored approaches are actively encouraged, and a weight of evidence approach is advocated using all available data, both product-specific and from similar or related products. However, there is still no scientific consensus on the most appropriate approaches or models to assess the risk of insertional mutagenesis and potential carcinogenicity of vector-mediated GTs for regulatory purposes. While both in silico and in vitro models are recommended in guidelines, and regulatory authorities state that animal studies should be reduced and avoided where possible, small-animal in vivo studies are still often used despite concerns over their relevance. 24
There is a clear need for the development and use of approaches that can reliably detect potential carcinogenicity of GTs in humans. Ideally, these assays should be predictive, human-relevant, fast, and cost-efficient, have reduced reliance on animals, and, where possible, be amenable to use in the clinic.
To action these concerns and to address the clinical need to accelerate the approval rate of investigational GT products, a group of invited expert scientists from industry, academia, and regulatory authorities met to explore the principles and open questions on genotoxicity and carcinogenicity hazard identification, risk assessment, and risk mitigation through nonclinical testing. Scientists were selected to cover expertise in different vector types, genotoxicity assessment, in vitro and in vivo genotoxicity models, regulatory and nonclinical risk assessment of gene therapy products. A set of consensus statements were drafted by a small coordinating group before the meeting of the expert group where the statements were amended and finalized. The intention was to achieve a high-level constructive, scientifically acceptable framework that could be agreed by all participants. The final report highlights areas of agreement, as well as areas where a consensus could not be achieved, and is intended for wide dissemination within the relevant scientific and regulatory communities to facilitate further discussion and inform new assay development.
In this article, we describe the scientific and regulatory background that led to this discussion, and present the consensus set of principles reached among scientists during a two-day workshop held in London on March 7 and 8, 2023, hosted by the U.K. National Center for the Replacement Refinement and Reduction of Animals in Research (NC3Rs) and the U.K. Environmental Mutagen Society (UKEMS). Representatives from the FDA CBER/OTP, MHRA, PMDA and (CBG-)MEB attended the meeting in a mainly observational capacity and provided high-level comments. The participation of regulatory agency representatives does not necessarily reflect the position of their respective agencies.
SCIENTIFIC BACKGROUND—EVIDENCE FOR CARCINOGENIC POTENTIAL
Before the initiation of GT clinical trials with long terminal repeat (LTR)-driven γRV, the potential for carcinogenesis was recognized, but the risk was considered to be low. 25 Yet, cases of vector-induced leukemia were observed even in small clinical trials, when using ex vivo hematopoietic stem cell GT with LTR- driven γRV. These included trials for X-linked severe combined immune deficiency (X-SCID), Wiskott–Aldrich syndrome (WAS), and adenosine deaminase deficiency severe combined immune deficiency (ADA-SCID). 8 –10,13 In addition, 3 out of 12 patients treated for chronic granulomatous disease (CGD; also ex vivo GT using LTR-driven γRV) and 3 out of 64 patients treated with Skysona/eli-cel (ex vivo GT using a self-inactivating LV vector (SIN LV) with strong internal retroviral MNDU3 promoter), all with long-term engraftment, showed myelodysplastic syndrome linked to activating retroviral integrations. 11,12,26 The latency to clinical evidence of leukemia development was between 9 months and 15 years after γRV GT, with an incidence varying from one out of more than 70 treated (ADA-SCID) to seven of 10 treated (WAS) patients. 8,27
A key mechanism resulting in vector-induced carcinogenesis was determined to be enhancer-mediated activation of endogenous proto-oncogene promoters by the LTR region of the γRV-vectors or by the strong retroviral MNDU3 promoter in the internal position of an SIN LV vector. 10,12,28 However, other mechanisms (such as disruption of tumor suppressor genes or of genomic loci leading to aberrant splicing and/or transcriptional termination) were likely involved as well in some cases. 26,29,30 It is worth noting that clonal expansion (in the absence of overt carcinogenesis) has also been observed in both clinical trials and in nonclinical studies, driven, for instance, by aberrant splicing events. 31,32 These events were observed even with SIN LV vectors. Carcinogenesis has also been observed in clinical trials with differentiated lymphocytes transduced with DNA transposons. Two cases of iatrogenic T cell lymphoma were observed in the CARTELL clinical trial, where CAR-T cells had been transduced with a nonviral piggyBac system. There was no integration into known oncogenes and the molecular mechanisms behind the lymphoma remain unclear. However, a high transgene copy number, an altered genomic copy number, and point mutations unrelated to the integration sites were all considered a contributory factors. 14,15 DNA transposons are mobile elements identified from Baculovirus, such as piggyBac or extinct fish such as Sleeping Beauty. 33,34 DNA transposons tend to integrate at TA or TTAA sequence motifs, but recent publications show lower frequency alternative integrations, with only one inverted repeat and with a distinct target sequence pattern. 35 –37 As long as the transposase remains active, multiple cycles of excision and integration are possible, increasing the risk of DNA transposon insertional mutagenesis. 29 Several mutant transposases have been developed to prevent secondary reintegrations or to modify their integration pattern. 35,36
Neoplasia has also been observed in animal models, using various GT vectors and routes of administration. For example, LTR-driven γRV (ex vivo) vectors and SIN LV vectors equipped with retroviral promoters (ex vivo) have been associated with neoplastic events in mouse models. 20,21 Neoplastic events in nonclinical models were primarily observed in tumor-prone mouse models administered early-generation LTR-driven γRV or LV with retroviral promoters that lack enhanced safety elements designed to reduce the risk of neoplastic transformation, such as the use of a four-plasmid production system and SIN LTR sequences. These modifications were introduced in the third-generation LV vectors currently in use in the clinic, improving their safety profile. 1,21
Despite the low integrative potential of recombinant AAV (rAAV) vectors, liver tumors have been observed in mice following dosing as neonates; tumors occurred after 1 year or later (being a significant proportion of their life span, compared with the follow-up in human patients given AAV GT). 22,38,39 Dosing of adult mice with rAAVs, in the presence of liver damage, also resulted in liver tumors. 40 AAV integration has also been found in liver tumors in adult Ornithine transcarbamylase (OTC) mice; however, tumor frequency was not elevated compared with controls, so the causal link to AAV insertional mutagenesis is unclear. 41 Oncogenic transformation due to rAAV has not been observed in healthy adult mice or any other species. In dogs, nononcogenic clonal expansion in the liver was recently reported in long-term studies in hemophilia A dogs treated with rAAV vectors. 23 Expanded clones showed integration in genes potentially associated with cell growth control. In addition, integrated vectors were generally truncated and the transgene commonly deleted. For unknown reasons, increases in Factor VIII were observed over time in some dogs. In another hemophilia dog study (using a distinct rAAV), clonal expansion was detected in some animals, and integrated vectors were also found to be extensively deleted and rearranged. The gene CCND1 was notably marked by integration in both studies. 42 In these two studies, no adverse consequences of clonal expansion were reported. It is known that clonal expansion is commonly associated with aging in dogs, which may have contributed to some of the observed cell proliferation. 42,43 Another study of two dogs treated with AAV for hemophilia A showed a multicentric lymphoma in one animal after AAV gene therapy. However, the tumor was not vector marked, indicating that integration did not contribute to transformation. Both dogs showed diverse vector integrations and, in a few cases, modest clonal expansion, but here too there was no evidence for genotoxicity based on AAV integration. 44 The American Society of Gene and Cell Therapy hosted a virtual roundtable on AAV integration in 2021, and its summary white paper reviewed the evidence of rAAV integration in animal models and the possible risks of insertional mutagenesis in patients. 45 Lastly, a recent very comprehensive study characterized the location, abundance, and expansion of rAAV integrations in the liver of a large cohort of nonhuman primates up to 15 years postdosing. These were compared with the same endpoints in nonhuman primates (and humans) naturally exposed to wild-type AAVs (wtAAVs). Although both rAAV and wtAAV showed a higher frequency of integration sites in regions susceptible to DNA damage or near highly transcribed genes, only a small proportion of rAAV-treated animals showed liver clonal expansions without signs of tumorigenicity and slightly lower than the expansions observed for wtAAV infections in the same species and in humans. 46 It should also be noted that rAAV integrations have also been detected in extrahepatic tissues, for example, the heart. 46,47
rAAV vectors have been administered to over 3000 patients in more than 200 clinical trials with no cases of carcinogenesis attributed to the rAAV vector. 1,45,48,49 For instance, no tumor development has been linked to insertional mutagenesis in over 3000 children dosed with Zolgensma (an rAAV-based therapy for spinal muscular atrophy) with up to 7 years follow-up, whereas liver tumors were observed in mice 20 months after neonatal dosing with a Zolgensma-like rAAV. 39,50 Although controversial, one group has shown that integrated wtAAV2 is observed in a rare subset of human hepatocellular carcinomas (HCCs) with increased RNA expression in nearby oncogenes. 51 –53 The relationship of wtAAV2 integration with altered RNA expression and HCC has not been definitively established and any implications of this observation for rAAV vectors are presently unclear. However, a plausible mechanism, linking a 3′ UTR enhancer—promoter element in wtAAV2 with liver gene expression/dysregulation, has been demonstrated, and this element, originally present in pSub201-derived rAAV vectors, should be excluded from rAAV vectors for clinical use. 54
BACKGROUND IN VIVO AND IN VITRO CARCINOGENESIS ASSAYS: STATE OF THE ART AND LIMITATIONS
The difficulties encountered with the assessment of potential carcinogenicity of vector-mediated GT in a nonclinical system can be attributed to several factors. Many of the critical events within the multistep process of carcinogenesis that are known to occur in humans are difficult to reconstruct in nonhuman models. There is incomplete understanding of how parameters such as dose and patient- and disease-specific factors influence the outcome, thus obscuring any causal relationships that may exist. Specific benefits and limitations of in vitro and in vivo models are shown in Table 1 and elaborated below.
Advantages/Disadvantages of Current In Vitro Versus In Vivo Genotoxicity Assays
HSC, hematopoietic stem cells; GT, gene therapy
In vivo assays
With the current state of knowledge, animal models have limitations for human carcinogenicity risk assessment in terms of specificity and sensitivity: a negative result does not exclude a human-specific genotoxic mechanism, while a positive result might not be human-relevant. In addition, in vivo studies present animal welfare concerns and they are expensive and time-consuming (especially in terms of long-term follow-up in large animals). For GTs, where in vivo animal studies are performed for general toxicity evaluation, some readouts could be informative in the context of carcinogenicity detection (i.e., detection of preneoplastic lesions, increased proliferation with Ki67 staining, hematology evaluation). The use of animals has the advantage of leveraging the cellular and tissue complexity (e.g., impact on engraftment and clonality of the transduced population, competition in fitness and survival, presence of the immune system), tissue microenvironment, and disease context that may influence carcinogenicity risk. For example, Hernández et al. reported transformation linked to chronic inflammation in CGD animals but not in healthy wild-type animals. 55 Animal models of ex vivo GT (RV and LV vectors) may include xenotransplantation of human cells into immune-deficient mice, serial transplantation, and/or use of tumor-prone mice. 56 –61 Use of tumor-prone mice increases the sensitivity of the assay in some settings, and has been able to predict mechanisms of genotoxicity and demonstrate the impact of vector design on genotoxic potential. 61 The (human) host genetic background (germ line or somatic mutations) is also difficult to adequately model in animals (e.g., potential impact on DNA repair, integration profiles, inflammation, or stress hematopoiesis). In summary, although animal studies have limited value in quantitative risk assessment of specific vectors in particular patient groups, they may be useful in understanding the pathogenesis (e.g., through histological assessment of preneoplastic lesions) of vector-induced tumors and in determining the role of potential risk factors.
In vitro assays
All in vitro assays, regardless of using human or nonhuman cells, have the limitation that they cannot recapitulate the complexity of a living organism. Examples of in vitro assays currently used for genotoxicity assessment of GT products are shown in Table 2.
In Vitro Genotoxicity Assays
In vitro assays using murine cells
Two murine in vitro assays in hematopoietic stem cells—in vitro immortalization (IVIM) and surrogate assays for genotoxicity assessment (SAGA), have been developed with appropriate positive controls to explore the mutagenic potential of certain GT vectors and to qualitatively support human risk assessment. 62 –64 Data generated from IVIM, alone or in combination with SAGA, have been accepted by regulators for LV and RV assessment to support approximately 20 programs. 64 In some cases, the use of the IVIM assay has also influenced the in vivo study design: a new vector, that differed from the parental one for only one element, was tested in vivo for a shorter time once proved to be similar to the parental one in the IVIM assay (personal communication from Dr. Rothe and Prof. Schambach). Currently both SAGA and IVIM assays are typically run together, although the SAGA assay seems to be more informative and robust. 64
Additional in vitro assays have been described to evaluate insertional events in murine cells upon transduction with RV or LV vectors. These include a double-negative 2 (DN2; CD4-CD8− CD44+ CD25+) arrest during T cell differentiation assay and an interleukin 3 (IL-3)-independence assay. 65,66 This group is unaware whether these assays have been used for regulatory purposes.
These in vitro tools are informative of mutagenic risk of GTs utilizing integrating LV and RV vectors and could potentially be used for DNA transposons, but they have not been optimized nor used for evaluation of rAAVs. However, concerns also remain, for example, that mouse surrogates may have limited predictivity where the risk arises from a human sequence and/or is species-specific. It is also unclear if the use of cells from diseased animal models could improve human translation, although unpublished IVIM and SAGA data from specific animal disease models (e.g., recombination-activating 1 (Rag1), Rag2 knockouts) and tumor-prone models (e.g., cyclin-dependent kinase inhibitor 2A (Cdkn2a) knockout) have not demonstrated a superior translation compared with cells from wild-type animals. However, in the case of Rag1 and Rag2 knockout cells, this was due to poor cell growth and viability compared with wild-type cells; whereas, in the case of Cdkn2a knockout cells, excess proliferation of mock-treated cells reduced the sensitivity to any further proliferative effects (personal communication from Dr. Rothe and Prof. Schambach).
In vitro assays using human cells
A variety of human cell-based in vitro assays have also been described (Table 2): hInGeTox assay, Jurkat-based assay, IL-2-independence growth assay, soft agar colony forming assay, growth in low-attachment assay, and long-term adverse treatment effect assay. 67 –71 Soft agar colony forming assay, growth in low-attachment assay, and long-term adverse treatment effect assay have been recently proposed for CRISPR-Cas9-induced transformation evaluation; however, their utility for viral- and transposon-based GT products remains to be assessed. The IL-2-independence growth assay has been used for genetically modified T cells, but its value is debated because of the limited predictivity, linked to the lack of appropriate controls and assay standardization. hInGeTox is a new assay based on induced pluripotent stem cells (iPSC) and iPSC-derived hepatocytes that combines phenotypic and molecular endpoints, however, more published data are required to evaluate its predictivity. 68
BACKGROUND—METHODS OF INTEGRATION SITE ANALYSIS
GTs pose several unique challenges for genetic toxicologists: all negative controls in a standard genotoxicity assay have a biologically defined background signal, arising from basal mutations and chromosomal damage from normal endogenous processes and/or background exogenous sources. Typically, a positive signal is defined as a statistically significant difference over this background. When using integration site analysis (ISA), negative control samples (i.e., nontransduced cells) have a zero value by default for toxicity assessment (apart from any technical artifacts), as even minimal insertion frequencies in treated samples are statistically significant, despite potentially only having limited biological impact in terms of mutagenesis and cancer risk. A more mechanistic approach might be to compare the extent of insertion events with controls having a more relevant “biological background,” such as a clinically proven safe vector or from comparable nononcogenic naturally circulating viruses.
In addition, the integration frequencies and patterns differ between vector types, requiring customized approaches for the interpretation of integration data. For example: Integration frequency is much lower with rAAVs than with RV/LV/DNA transposon-based vectors.
72
However, a recent report suggests that rAAV integration might drive long-lasting transgene expression versus short-lived episomal expression, which, if confirmed, would therefore be beneficial.
73
Integration patterns differ between RV and PiggyBac transposons (promoter regions), LV (gene bodies), and Sleeping Beauty transposons (almost random).
74,75
Current data suggest that the integration pattern for RV/LV/transposon-based vectors shows more host genome sequence bias compared with rAAVs, the latter largely considered random and enriched for fragile DNA sites and loci with high expression levels and decreasing over time after dosing.
46,73,76
However, while it has been speculated that homology-directed repair (HDR) may be contributing to rAAV integration, this appears to be a minor pathway, since even vectors designed for targeted integration achieve HDR at one-tenth of random integration efficiency.
77,78
Furthermore, the transgene is generally cloned as complementary DNA, and codon optimization would further reduce the sequence homology between the vector and host genome and, as a result, HDR integration efficiency. Accordingly, human hydroxymethylbilane synthase (hHMBS) gene was not reported among the common integration sites by multiplex linear-amplification mediated PCR (LAM-PCR) neither in patients nor in nonhuman primates treated with rAAV expressing codon-optimized hHMBS.
79
rAAVs, in particular, frequently show integration of concatemers and/or complete and partial vector genomes.
23,46,47,73,79
DNA transposons can remobilize within the genome (so long as the transposase remains active) as opposed to stable integration seen with viral vectors.
29
ISA has high negative predictive value (i.e., lack of integration suggests safe vectors), but limited positive predictive value (i.e., evidence of integration does not mean unsafe vectors). Current ISA methods typically report hundreds to thousands of integrations per biological specimen; therefore, data interpretation is based on the identity of nearby genes and enhancer annotation should also be considered. It should be noted that a significant proportion of the human genome consists of “cancer-associated” genes when broad literature mining definitions are used. 80 Given the size of these data sets, it is likely that proto-oncogenes (and other cancer-associated genes) will be identified near vector integration sites by chance alone. Therefore, in many cases, the insertional findings should be of negligible concern, unless both their location is biologically relevant and their frequency is high enough, or expanded clones are detected. It is probable that most integration is benign, even if it is a prerequisite for transformation leading to clonal expansion and tumorigenesis. Lastly, the age-related background level of mutations should be considered. For instance, Martincorena showed a high frequency of mutations in cancer driver genes in normal epithelial tissues of healthy adults, whose number increased with age. 81
Rather than focusing on a single time point, it might be more informative to evaluate integration site (IS) profiles at different time points (e.g., drug product (before administration) and after in vivo treatment or, for an in vitro study, at an early and a late time point) to identify any potential trends of clonal enrichment. Statistical analysis should distinguish clusters of integration sites resulting from vector-specific integration preferences from those resulting from a selection process, potentially by combining gene integration frequency and integration site abundance. However, clonality itself does not necessarily reflect a precarcinogenic state. For example, for T cell products, clonal enrichment could be dependent on the presence or absence of antigen presentation: antigen-dependent expansion is likely benign and linked to physiological proliferation (Cbl and Tet2 CAR-T cases), while cases of antigen-independent expansion are either linked to tonic signaling or carcinogenicity, for example, ex vivo culture, or transposon cases. 14,15,82 –84 Similarly, clonal expansions have been observed after SIN LV gene therapy in hematopoietic stem cells (HSCs) associated with integration in HMGA2 in one beta-thalassemia patient and multiple SCID patients. 31,32 In these cases, disrupted splicing appears to have resulted in a truncated messenger RNA, which removed negative regulatory sequences and resulted in cell proliferation.
The biological relevance of clonality could be carefully considered using a combination of transcription profile changes, cell growth dynamics, cancer associations reported in the literature, and human data. To ultimately define the impact of a given integration (e.g., driver vs. passenger role), a targeted integration could be generated, and its functional consequences (including clonal enrichment) assessed, but the feasibility and realistic throughput of such an approach are currently very limited. 40,85
Numerous ISA methods have been developed each with different degrees of sensitivity, specificity, potential sources of bias, and regulatory acceptance (see Table 3 for acronyms and brief description). These methods include LAM-PCR, non-restrictive LAM-PCR (nrLAM-PCR), ligation-mediated PCR (LM-PCR), shearing extension primer tag selection followed by ligation-mediated PCR (S-EPTS/LM-PCR), and TES, among others. 42,47,86 –88 Restriction enzyme- and linear amplification-based methods have largely been replaced by sonication-based and linker-mediated technologies, respectively, due to reduced bias and potentially increased sensitivity. TES relies on DNA hybrid capture instead of PCR amplification, avoiding bias toward inverted terminal repeat (ITR) or LTR integrations. To the best of our knowledge, only the methods above have been used for regulatory submissions. New methods are being continuously developed, such as multiplex LAM-PCR, CreViSeq, ITR-seq, Target-seq, long-read sequencing using Pacific Biosciences or Nanopore technology and INSERT-seq, but to our knowledge they have not been fully validated nor used for regulatory submission. 79,90 –95 Viral integration can also be determined by whole-genome sequencing (WGS) but its sensitivity/specificity is limited and requires high sequencing depth and coverage and should be only considered for clonal cell populations. 47 A recent article compared TES, S-EPTS/LM-PCR, and WGS for AAV integration, showing similar sensitivity of TES and S-EPTS/LM-PCR with a lower sensitivity for WGS. 47
Integration Site Analysis Methods
As for any assay validation, use of positive and negative controls is important to demonstrate assay performance and support data interpretation. Examples of technical positive control materials include cell line(s) or isolated clones with known integrations, vectors with sequences that are known to have active contributory elements that drive oncogenesis, while clinical comparators would be any vectors linked previously to clinical malignancies. 88,95,96 For applications involving detection of low-frequency integrations, careful analytical validations entailing positive controls spiked into negative controls at different relative frequencies should be used to rigorously to demonstrate limit-of-detection. Currently, this has been a challenge for evaluating AAV-based vectors, as no clinical malignancy has been identified with rAAV GT.
CONSENSUS STATEMENTS
Based on the current state of the science and agreed among all experts, the group proposes the following
In vivo and in vitro assays
The existing regulatory approval process for GTs should continue to develop to ensure detection of potential causes of carcinogenicity that are relevant to humans. Areas of uncertainty in current nonclinical models will require considerable research efforts in the future. However, these efforts to improve on existing suboptimal/incomplete nonclinical assays should not delay the development of novel therapies.
Current nonclinical assays should be evaluated with clinically relevant products, ideally back-translating real-world examples and confirming negative and positive predictivity.
For all assays, either based on human or nonhuman cells, the low throughput and lack of broad access are limitations that need to be addressed.
Appropriate negative and positive controls, protocol standardization, and assay validation are required for wider acceptance and to support routine regulatory decision-making. Because of the current knowledge gaps and the ever-evolving technologies for the assessment of the carcinogenicity potential of GTs, when a validated assay is not available or validation is not achievable, the use of exploratory, in-development, research-grade, noncentralized assays should be considered by regulatory decision makers, provided they are scientifically sound.
Whenever possible, the focus should be on the “3Rs” principles of reducing animal usage, replacing them by alternative human-relevant nonclinical systems, and, where that is not possible, using the information from in vitro assays and in silico tools to refine them to improve animal welfare.
Integration site analysis
Alongside technical controls (untransduced cells, nonintegrating vectors), we suggest different approaches to define clinically negative controls for ISA, according to the vector type: For any vector types (LV, γRV, rAAV, Sleeping Beauty, and piggyBac transposons), a comparison with similar vectors proven to be clinically safe for ≥15 years (or as long as feasible) and ideally in the same target cell type. For LV, comparing with naturally occurring HIV infections in matching cell types (e.g., T cell or macrophages).
97
For γRV, comparing with murine leukemia virus infections in dividing B or T cells or similar cell lines could be explored, but species differences (mouse vs. human) limit this approach.
98
While wtAAVs are unlikely to be carcinogenic, their value as negative controls for comparison with rAAV clinical data is less clear as wtAAVs express viral proteins, including Rep protein, which is responsible for integration in vitro.
99
However, the frequency of Rep-driven AAVS1 integration hotspot identified in vitro for wtAAVs is variable across cell types and has not been confirmed consistently in human tissues, while other integration sites have also been identified.
46,52,53,93
Since rAAVs lack Rep protein, and although the relevance of Rep on wtAAV clinical integration might be limited, the value of any comparison remains debatable and should be justified case by case. Furthermore, for rAAVs, it is unclear how in vitro and clinical integration patterns correlate, given the discrepancy observed for wtAAVs. Nevertheless, clinically proven safe vectors or technical controls remain suitable comparators. For transposons, there are no naturally occurring integrations from currently used Sleeping Beauty and piggyBac. There are DNA transposons and inverted repeats throughout the human genome, including PGBD5 and RAG1/2 which may be involved in the development of some tumors.
100,101
The sequences of the RAG1, RAG2, and PGBD5 inverted repeats differ from those in Sleeping Beauty and piggyBac transposons and there is no evidence of cross reactivity. However, any newly developed transposon systems should be assessed for functional and ISA cross-reactivity with endogenous DNA transposons.
Approaches to risk assessment
All vectors have an integration risk; however, the frequency of integration events, as well as the integration pattern, varies between vectors of the same class or among different types of viral vectors.
As a general rule, the risk assessment package should be designed case-by-case taking into account several factors, including but not limited to type of vector, vector design, target disease/tissue and patient population, vector dose, route of administration, and tissue distribution. It should be borne in mind that depending on the vector serotype, dose, and route of administration, tissue distribution beyond the intended (therapeutic) target tissue could occur and this should be considered when selecting tissues for ISA and for the overall risk assessment.
For many current GTs, often targeting severe diseases with limited treatment options, the risk of carcinogenesis is clearly outweighed by the therapeutic benefit (e.g., FDA and EMA approvals for eli-cel/Skysona even with 3/64 myelodysplastic syndrome (MDS) cases and 11/64 oligoclonality cases. 12
When the field progresses into less severe diseases or those with existing treatment options, the risk/benefit analysis will require a more “quantitative” understanding of the risk (for instance, by classifying GT products into broad “risk categories”).
Despite the inherent mutagenic potential of integrating vectors, only a limited number of integrations have been associated with tumor formation. The binary presence/absence of integration is insufficient to predict the likelihood and risk of eventual carcinogenesis in any specific context. Therefore, lack of integration has a high negative predictive value, but a positive result requires further evaluation.
Several factors plausibly influence the risk for carcinogenesis following one or more genomic integrations in a target cell. These include the following: nature of the target cell (lineage, state of differentiation, epigenetic status, previous natural viral infections), vector type, vector design (e.g., tropism, nature of transgene and promoter/enhancer (strength, transactivation level), presence of LTRs, SIN design, splice sites), dose of vector, route of administration, disease background, individual patient factors (age, preexisting conditions, genetic predisposition, comedications, lifestyle factors), and proliferation rate in the target cell population following transduction. It is also important to consider that cell types vary in the number of mutations required to drive cancer formation and in the latency to cancer formation. 102 A useful review of risk factors in the context of HSC GT has recently been published. 103
Nonclinical carcinogenicity risk assessment for a novel GT product based on any vector (given ex vivo or in vivo) with the ability to integrate into host DNA (by design or incidentally) would benefit from a standardized “weight-of-evidence” approach, as shown in Table 4. This approach could be applicable as the field progresses to incorporate other viral vectors. 72
Risk factors considered to increase risk (of carcinogenesis) should be weighed against those that mitigate against. This can be further supported using product-specific assays assessing biologically relevant endpoints where available (Supplementary Table S1). Given the divergent limitations of both in vitro and in vivo approaches, a tiered approach starting with in vitro assays followed by appropriate and 3Rs-driven in vivo experiments should be adopted as a standard approach.
Any product-specific assays used should ideally possess the characteristics described in Supplementary Table S1. While this group is not prescribing specific assays to be used for any particular product, it is hoped that the use of a standardized overall approach would allow meaningful comparisons (of risk) within specific groups of GT products. It is recognized that many of the items in Supplementary Table S1 are aspirational/forward-looking, with the aim of producing the most human-relevant/predictive assays.
As knowledge and experience increase, a more deterministic approach, such as an “adverse outcomes pathway (AOP)” analysis, could be used to illustrate potential pathogenesis of GT-induced carcinogenesis. 104 Initially, AOP analysis on existing (non-clinical and clinical) cases of GT-induced carcinogenesis could be used to graphically demonstrate possible pathway(s) from a “molecular initiating event” (in this case integration of vector into host DNA), through a series of “key events,” to a potential “adverse outcome” (AO; e.g., lymphoma following X-SCID or WAS GT; liver tumors following GT of neonatal mice). Further details on the AOP approach (with an example of a regulatory approved (non-GT) AOP) are given in Supplementary Data S1, along with a hypothetical AOP framework for GT-driven carcinogenesis, and how this might be used prospectively.
Generic Risk Assessment for a Novel (Integrating) GT Product
Each parameter should be assessed qualitatively (and where possible quantitatively) to produce a weight-of-evidence- based risk assessment.
Includes cells unintentionally transduced by GT vector due to unavoidable biodistribution.
Future developments/Research needs
The areas identified by the group for future development and focus were as follows:
The vectors that have caused the leukemia cases observed in the clinic with X-SCID, WAS, and ADA-SCID are relevant positive controls. Their insertional mutagenesis mechanism is reasonably well known, even if the exact contributions of some critical product attributes and potential individual risk factors to the eventual carcinogenic outcome are not fully understood.
As data and experience are gained through more standardized approaches, it may become possible to set ranges for certain parameters (such as vector copy number (VCN), multiplicity of infection, or vector dose). For instance, for ex vivo GT, determining the average VCN per transduced cell is considered a more accurate determinant of product risk. 4,105
More effective risk assessment of GT could be achieved by closing certain knowledge gaps. The following, noncomprehensive, list includes current knowledge gaps in the GT field identified by the authors, which should be subject to further research.
Integration site analysis: Insufficient sensitivity of ISA methods to detect very low numbers of genomic integrations in a cell population. Difficulty in the detection of partial (e.g., without LTR/ITR sequence) or rearranged integrations by most methods. Unknown thresholds for safe integration in/near known oncogenes, whether previously associated with vector-driven transformation (e.g., LMO2) or not. Lack of harmonization/cross-validation of ISA methods/platforms, and reporting, to facilitate historical comparisons across vectors/companies. Cancer-associated gene lists are currently inconsistent across different laboratories and are based on varying criteria used to define an “oncogene.” These lists may also vary between species. Different vector types show distinct genomic integration patterns (promoters vs. gene regions vs. semirandom, often dependent on, e.g., GC content, epigenetic marks); however, correlation of these differences with safety across different vector types is only partially understood. The impact of random integration versus targeted integration (integration “hotspots”) and quantitative risk is not well defined.
Clonal tracking: Lack of agreement on a threshold that defines clonal outgrowth and on the cell population to be used as denominator (e.g., total white blood cells vs. individual blood cell lineages). Poor understanding of the relationship between (oligo-/mono-) clonality arising within a population of transduced cells and potential neoplastic transformation. Limited understanding of the relative differential risk that should be attributed to different mutations that define these clones. Interpretation of clonal dominance when confounded by occurrence of multiple integrations within the same cell.
106
Difficulty of clonal tracking over time in solid tissues in the clinic, although approaches to this issue with cell-free DNA are in development.
107
Epigenetics/transcriptomics: Role of epigenetic status on integration of specific GT vectors. Impact/predictivity of GT vector-induced epigenetic and transcriptomic changes on/for risk of neoplasia.
Statistical methods: Interpretation of ever-expanding data sets. Lack of statistical/mathematical approaches to model dynamics of potentially adverse clonal outgrowth in vivo, for example, by combining gene integration frequency and IS abundance.
In vitro assays: Current in vitro models limited by short-term follow-up and restricted ability to simulate competitive selection/clonal outgrowth. Lack of qualified/validated assays for rAAVs. Lack of availability of positive controls for some vectors (e.g., rAAV) and/or some cell types. Validation, standardization, availability of assays in general.
Dose relationships: For rAAVs in vivo, the relationship (linear, with threshold) between VCN in a sample and integration frequency. For any cell population exposed to a fixed number of integrations (fixed bulk VCN), there is poor understanding of the relative risk between integrations spread across all/the majority of the cells (high transduction efficiency) or integrations occurring in a small proportion of cells (low transduction efficiency). Allowing for differing frequencies of integration (of differing vector types), the relationship between the total number of cells carrying integrations and risk is unclear.
Target tissues: The sensitivity/resistance of different target (intended or unintended) tissues to neoplastic transformation following integration of GT vectors. Absolute resistance of differentiated cells to neoplastic transformation. The quantitative relationship between rate of cell turnover (in any given tissue) and sensitivity to neoplastic transformation.
108
Patient factors:
Nonintegrating GT vectors: Potential risks associated with nonintegrating GT vectors carrying strong promoters, regulatory elements, or transgenes with growth-promoting effects.
Basic biology of rAAV vector–cell interactions: Comparison of wtAAV with rAAV. rAAV integration patterns in animals compared with human. The authors propose that a publicly available database containing rAAV integrations in nonclinical species, and from clinical settings must be set up. This must be associated with a minimally agreed set of metadata (genome coordinates and some annotation (e.g., nearby genes)) to maximize the value of these data for others in the field and characteristics of the associated rAAV. Understanding the molecular mechanisms of rAAV-induced liver tumors in mice, from Rian locus integration to tumor formation. Understanding rAAV integration frequency and pattern in nonhepatic tissue and associated risks.
Concerted research efforts (e.g., multisite studies via public consortia) are recommended to characterize/qualify assay performance, ideally leading to validation. Some human-relevant alternative assays, models, and tools for the study of potential carcinogenicity already exist, but many of these have not been taken forward into an assay validation process. A recently initiated multisite study led by HESI CT-TRACS to evaluate the IL-2-independence assay for T cell transformation is a good model on which such initiatives could be based.
In addition, suitable assays are still missing for some mechanistic aspects of carcinogenicity and dedicated research projects for assay development are needed to fill these gaps. The development of human-relevant novel in vitro assays to assess the tumorigenicity of genetically modified T cells (CRACK-IT challenge T-Alert) and genetically modified HSCs (CRACK-IT Challenge Clean Cut) is a current example of industry/academic collaborations, both led by the NC3Rs, working to develop new assays in this area. A wider testing of novel assays is also important to ensure their fitness for use, and initiatives such as the HESI CT-TRACS, which will perform a multisite study on the T-Alert Challenge assay(s), are highly warranted.
SUMMARY
This article outlines a set of high-level principles that reflect the current state of the art and expert knowledge, to guide and inform the assessment of potential vector-mediated genotoxicity/potential carcinogenicity of the GT products described. The GT field is rapidly evolving, and it is expected that as progress is made, new knowledge and understanding can be integrated into these principles. Data transparency will be essential, and the authors specifically propose that data generated from viral IS studies in nonclinical species, and from clinical settings, are made publicly accessible (through, e.g., databases and patient registries). There is a clear need to develop improved nonclinical assays and the potential for nonanimal, more human-relevant approaches will deliver significant advances in our understanding, even if there will remain in the near-term, a need to use animal models.
Key to realizing these advances is collaboration and engagement across the sectors, capitalizing on opportunities to build on successful academic and industry collaborations through funding initiatives such as the NC3Rs CRACK IT program and the HESI technical committees.
The increasing role of GT in medicine mandates a better understanding of vector-mediated safety concerns to allow the full potential of GT to be exploited.
Footnotes
ACKNOWLEDGMENTS
The Steering Group would like to thank Dr Sandhya Sanduja, Food and Drug Administration (FDA), Dr James McBlane, Medicines and Healthcare Products Regulatory Agency (MHRA), Dr Kazushige Maki, Pharmaceutical and Medical Devices Agency (PMDA), and Tineke van den Hoorn, Dutch Medicine Evaluation Board (CBG-MEB), for their participation and insightful contribution to the scientific discussions.
AUTHOR CONTRIBUTIONS
S.J. conceived the original idea for this work. J.C.K., A.D.R.E., S.L., S.J., P.C., M.D.F., A.M.L., H.J.M., and A.Z. wrote the first draft of the article. I.K. chaired the face-to-face meeting. All authors contributed to the meeting and to commenting on the first draft of the article. Following the meeting, J.C.K., A.D.R.E., S.L., P.C., M.D.F., A.M.L., H.J.M., C.V., and A.Z. wrote the second draft of the article. All authors reviewed and commented on the second draft of the article.
DISCLOSURE STATEMENT
No money was paid for the preparation of this article. During preparation of this article, the salaries of contributors were paid by their respective employer. None of the content of the article contains any information that could be patentable or claimed as intellectual property of the contributors or their respective companies. In addition, the following declarations were made by individual contributors.
J.C.K. is founder of Cornelis Consulting Ltd. and serves as a consultant to Apollo Therapeutics, AviadoBio, Orchard Therapeutics, and Sangamo Therapeutics; he is an ex-employee and equity stake holder of GSK and is on the scientific advisory board of Testavec. He received no financial reward for his participation in this work. A.D.R.E. is a full-time employee of GentiBio, Inc. and holds shares from Novartis Pharma AG. H.J.M. and S.L. are full-time employees and own stocks of Novartis Pharma AG. A.M.L. is an equity stake holder of GSK. C.V. declares no competing interests. A.Z. owns stock of F. Hoffmann-LaRoche Ltd. L.B. works for and owns shares of Nvelop Therapeutics. M.H.B. is employed by cellvie AG, Zürich, Switzerland, and a shareholder of cellvie AG. R.B. declares no competing interests. T.C. serves as an advisor to Aavigen, Cimeio Therapeutics, Excision BioTherapeutics, GenCC, and Novo Nordisk, has a sponsored research collaboration with Cellectis, and holds several patents in the field of genome editing. H.C.J.E. is a cofounder of Virion Therapeutics, Inc. She as a consultant to several Gene Therapy companies. R.G. a full-time employee of Protagene CGT GmbH, a CRO providing services such as integration site analysis to assess the biosafety of gene therapy vectors. G.G. is a scientific cofounder of Voyager Therapeutics, Adrenas Therapeutics, and Aspa Therapeutics, and holds equity in these companies. G.G. is an inventor on patents with potential royalties licensed to Voyager Therapeutics, Aspa Therapeutics, and other biopharmaceutical companies. J.J.L. provides consultancy services for BlueWhale Bio. I.K. is a shareholder of Astra Zeneca. T.A.L. is an employee of Pfizer. B.L.L. is on the following Scientific Advisory Boards: Avectas, Capstan (Chair), Immuneel, Immusoft, In8bio, Ori Biotech, Oxford Biomedica, Thermo Fisher Pharma Services, UTC Therapeutics, Vycellix. Scientific Advisor: Kite. Past: Akron. B.L.L. is cofounder and equity holder: Tmunity Therapeutics (acquired by Kite), Capstan Therapeutics. B.L.L. Conflict of interest is managed in accordance with the University of Pennsylvania policy and oversight. M.O. declares no competing interests. D.M.P. works for Merck & Co., Inc., Boston, MA, USA, and owns shares of Merck & Co., Inc. and AVROBIO, Inc. S.R. is cofounder and CSO of Broken String Biosciences Ltd. D.E.S. is a consultant for Poseida Therapeutics and Biomarin Pharmaceuticals and receives licensing royalties from Spark Therapeutics. J.S. is a founder and equity holder in TwinStrand Biosciences Inc. M.R. and A.S. support academic and industrial partners with work on preclinical assays IVIM and SAGA on a collaborative or fee-for-service basis. M.R. and A.S. hold a patent on the analytical process of SAGA (WO2017108598A1). M.T. is a cofounder and Director of TestAVec Ltd., spun-out from Brunel University, and holds a patent concerning hInGetox “Method for Testing a Gene Therapy Vector” IP number PCT/GB2018/051937 with several collaborative projects in safety for gene therapy.
FUNDING INFORMATION
This work did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. The U.K. NC3Rs supported attendance costs for the EU and U.K.-based academic attendees and UKEMS provided bursaries for a number of the workshop attendees. NC3Rs hosted the workshop in London, March 2023.
SUPPLEMENTARY MATERIAL
Supplementary Data S1
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
