Abstract
LIST OF ABBREVIATIONS
Huntingtin
polyglutamine
Huntington’s Disease
N-terminal Htt fragments up to and including amino acid x
polyglutamine sequence containing n glutamines
Cytomegalus virus
Green Fluorescent Protein
mixed category of inclusions
Fetal Calf Serum
2-(4-amidinophenyl)-1H -indole-6
Phosphate buffered saline
BACKGROUND
CAG-trinucleotide expansion mutations, which encodes a polyglutamine stretch within exon 1 of the HTT gene beyond 36 repeats causes Huntington’s disease (HD) [1]. A key feature of HD pathology are intracellular inclusion bodies comprising N-terminal fragments of mutant Huntingtin (Htt) that encompass the polyQ segment [2]. The exogenous expression of exon 1 in polyQ-expanded form is sufficient to recapitulate neurological defects and aggregation pathology resembling some features of HD in rodent and primate models [3–5]. Because of this, the Htt exon 1 fragment, which comprises 90 amino acids 1 , has been extensively used to model the molecular mechanisms of HD.
The fragmentation process of Htt, and possibly direct production of truncated aberrantly spliced translation fragments, seems to be critical to mediating pathology [6–8]. Htt forms an array of different size cleavage products, putatively by caspase, calpain and other protease activity that precede pathogenesis [9, 10]. The transgenic expression of the shorter Htt fragments generally confers greater toxicity than longer lengths, as demonstrated recently in a systematic comparative study in Drosophila [11] as well as in mouse models where shorter fragments typically invoke a more severe phenotype [12]. Mouse models that suppress generation of the N-terminal cleavage products by mutation of putative caspase proteolytic cleavage sites in Htt also lead to delayedpathologies [13].
Some of the key fragmentation steps of Htt or commonly used fragment models are summarized as follows. Caspase family (and possibly other [14]) protease cleavage products have been suggested to occur at aspartate residues 513, 552, and 586 [13, 15–17]. Calpain cleavage products have also been shown to occur cleaving at residues 469 and 536 and appear important for mediating aggregation and toxicity in cell culture [7, 17]. Fragments from human brain inclusions, designated as cp-A and cp-B are of between 104–114 and 146–216 respectively [18–21]. Mouse models expressing the 171 fragment have been made that like Htt exon 1 can model HD pathogenesis [22].
During our studies on different proteolytic lengths we found that the aggregation pattern in cultured mammalian cells was not necessarily related to the length of polyQ. Given the importance of aggregation as a marker of pathology and its sometimes controversial role in relating to toxicity [23], we sought to systematic define the relationship between fragment length and polyglutamine length on aggregation patterns. Our findings described here indicate aggregation into visible inclusions is a complex pleiotropic-like phenomenon that in some contexts is poorly related to polyQ length. These results highlight the importance of understanding the molecular bases of aggregation phenomena observed morphologically in cellular contexts.
MATERIALS AND METHODS
Plasmid vector construction
The cDNA of full length human Htt (in 25Q form) was amplified into the fragment lengths shown in Fig. 1 by PCR; and then cloned into pEGFP-N1 via the SmaI and BamHI sites. The resultant constructs expressed EGFP as C-terminal fusions to the Htt fragments. The 97Q forms were made by replacing the 25Q cDNA with the corresponding cDNA sequence from a previously described pTIREX plasmid encoding human Htt exon 1 with 97Q [24] via internal restriction sites (EcoN1 and SacII). All constructs were sequenced for verification. N90 plasmids fused to mCherry were prepared in the pGW1 vector as previously described, and the vectors for this were kindly provided by Steven Finkbeiner [25].
Cell culture
African green monkey kidney fibroblast-like cell line COS1 and human epithelial kidney cell line AD293 were cultured in complete DMEM (Invitrogen) media supplemented with 10% (v/v) fetal calf serum (FCS), 1 mM L-glutamine, and 200 μg/ml penicillin/streptomycin. Mouse neuroblastoma cell line Neuro-2a was cultured in complete OptiMEM (Life Technologies)) media supplemented with 10% (v/v) FCS, 1 mM L-glutamine and 100 μg/ml penicillin/streptomycin. Cells were cultured at 37°C in a humidified incubator with 5% atmospheric CO2. Cells were passaged every two days, with COS1 and Neuro2a passaged at 1/4 dilution and AD293 at 1/8 dilution.
Transfections
For flow cytometry, 1.4×105 COS1 cells were plated per well on 24-well plates. The next day cells were transfected with Lipofectamine 2000 (Invitrogen) reagent containing 640 ng Htt fragment-EGFP vector and 160 ng mCherry vector. For confocal microscopy, 9×104 Neuro2a cells or 7×104 COS1 or AD293 cells were plated per well on 8-well μ-slides (Ibidi). For the co-transfection experiment of N90 with N586, COS1 cells were transfected with 100 ng N90 (97Q) fused to mCherry and 400 ng N586 constructs fused to EGFP using Lipofectamine 2000 reagent.
Confocal imaging
48 hours after transfection cells were stained with 0.1 μg/mL 2-(4-amidinophenyl)-1H -indole-6-carboxamidine (DAPI) in phosphate buffered saline (PBS) for 15 mins. Then cells were fixed with 4% (v/v) paraformaldehyde in PBS for 15 min. Cells were imaged on a Leica SP5 confocal microscope using a 40× or 63× objective lens. The DAPI channel was collected with an excitation wavelength of 405 nm and emission wavelengths of 445–500 nm; EGFP was collected by excitation at 488 nm and emission from 520–570 nm; and mCherry was collected by excitation of 561 nm and emission range of 600–700 nm. Tiled images were taken by using 40× objective lens.
Image analysis
All microscopic images were processed by Fiji (ImageJ) software [26, 27]. For the quantitation of inclusion types images of cells co-transfected with the different Htt fragment-EGFP fusions and mCherry and then stained with DAPI were analyzed by software Cellprofiler [28] on microscopic images of cells. Cells were automatically detected and traced using a nuclear staining mask (DAPI) and a whole stain mask (mCherry). The settings were calibrated manually by visual inspection for fidelity of object recognition for each experiment (this was not done by a blinded observer). mCherry fluorescence was recorded in each cell and each cell was manually categorized for aggregate phenotype.
Immunofluorescence staining
COS1 cells were transfected with the human Htt fragment constructs and fixed and processed for immunostaining at 48 h post-transfection. As a control for ubiquitin staining of cellular aggregates, COS1 cells were treated with 1 μM MG132 or equivalent DMSO amount (0.2% v/v final) for 16 h. For ubiquitin staining, cells were fixed in methanol for 3 min at –20°C then blocked with 10% v/v normal goat serum (Thermo Fisher Scientific). For cathepsin D staining, cells were fixed in 4% (v/v) paraformaldehyde in PBS for 15 min at room temperature, permeabilized with 0.2% v/v Triton X-100 for 10 min and then blocked with 10% v/v normal donkey serum (Jackson Immunoresearch). Fixed cells were incubated with mouse anti-ubiquitin (Millipore, mab1510) or goat anti-cathepsin D (Santa Cruz, sc6494) antibodies overnight at 4°C. Secondary antibodies, donkey anti-goat (Thermo Fischer Scientific) or goat anti-mouse (Jackson Immunoresearch), were conjugated to Alexa Fluor 647. Nuclei were counterstained with 1 μg/ml Hoechst 33342 (Thermo Fisher Scientific) in PBS. Stained cells were imaged on an IN Cell Analyzer 6000 (GE Healthcare) at 40× magnification using the appropriate excitation/emission channels (405 nm/455 nm, 488 nm/525 nm, 644 nm/706 nm).
Western blot
2.8×105 COS1 cells were plated per well on 12-well plates. The next day cells were transfected with Lipofectamine 2000 (Invitrogen) reagent containing 1.6 μg Htt fragment-EGFP vector. 48 hours after transfection cells were detached and lysed as described previously [29]. Htt fragments-EGFP were dectected using a 1:10000 dilution of an antibody to GFP (A6455: Life Technologies).
Statistics
For evaluation of differences in means (and S.E.M.) the Kruskal Wallis non-parametric 1 way ANOVA with post-hoc Dunns test was used for samples comparisons using Prism (Graphpad) software packages. Differences with P < 0.05 were considered significant (*), P < 0.01 were displayed as (**) and P < 0.001 were displayed as (***).
RESULTS
We first constructed an array of eight N-terminal fragment constructs that match some of the more commonly studied fragments fused at the N-terminus of EGFP (Fig. 1A). This included lengths for exon 1, which has 90 amino acids using the 23Q wild-type polyQ length nomenclature (N90), and N105 2 , N117, N171, N513, N536, N552, and N586. Each construct was made with a 25Q length and a corresponding 97Q length (for a total of 16 constructs). All constructs expressed products of the anticipated mass by Western Blot (Fig. 1B).
As anticipated, all fragments with a HD-associated polyQ length (97Q) formed visible cytoplasmic aggregates in cells when expressed transiently from a CMV promoter (Fig. 2A). Unexpectedly, the longer fragments (above 117 amino acids) with a wild-type polyQ length (25Q) expressed under the same conditions also formed visible foci regardless of the polyQ length, and did so to a similar extent as the 97Q counterpart (Fig. 2A). All constructs had cells with cytoplasmic inclusions except those that expressed N90, N105 and N117 with the 25Q, which did not display any evidence of visible aggregates (Fig. 2A). There was a sharp peak in number of cells with inclusions for 25Q constructs at 171 amino acids. Because aggregation of N90 and other proteins more generally are known to be affected by expression level [30], we measured the comparative expression levels of the fragments by flow cytometry by virtue of the EGFP tag. The expression levels were higher for N90-N105 with 25Q compared to 97Q, whereas expression levels of larger constructs were similar among all constructs regardless of Q length (Fig. 2B). One possibility that might exaggerate the appearance of aggregation in cells with the 25Q forms of the fragments is if the 97Q form leads to toxicity and therefore resulted in a selective loss of highest-expressing cells. Hence, to test for equivalent gene dosage, a control mCherry construct co-transfected with the Htt fragments. For most fragments there were similar levels of expression, and for the longer fragments (N513-N586), there was more expression of mCherry for the longer 97Q forms, which discounted this possibility being a major factor (Fig. 2C). These data hence indicated that the aggregation observed for longer Htt fragments (N171 and longer) was essentially independent of the polyQ expansion (Fig. 2B). Another note from this data is that the decreased expression levels of longer Htt fragments (N513-N586) may account for the lower occurrence of cells with aggregates in these populations as the length exceeds N171 (Fig. 2A).
To investigate the non-polyQ mediated aggregation process in more detail, we examined the morphology of the aggregates by confocal microscopy. Across all of the fragment lengths we noticed four morphological categories of visible aggregates (Fig. 3A). These four categories, Types A–C and a mixed (M) group of the other inclusion types were observed in all three different cell types (COS-1, and AD293 and Neuro2a, Fig. 3B) suggesting the three aggregation processes are governed through common mechanisms in different cells. No inclusions were seen in any cell type with fragments less than and including N117 in the 25Q form. Also, N90, N105 and N117 containing 97Q formed only one type of inclusion in all the cell types, which we designated as “Type A” inclusion (Fig. 4). Type A inclusions were characterized by a symmetrical shape, relatively large size (typically >10 μm in diameter) and typical appearance as a singular entity in cells (Fig. 3A). Type A inclusions also correlated to a depletion of “soluble” Htt protein from the rest of the cell (Fig. 5). Type A inclusions only appeared in cells with the pathogenic polyQ expansion (Fig. 5) so their formation seems to be mechanistically driven by the polyQ expansion. Type B aggregates, like Type A were large in size (more than several μm in width) but had a more asymmetric and extended shape with a more variable location in the cytoplasm (Fig. 3A). Cells with type B inclusions appear to retain soluble Htt that had not been sequestered into the inclusion (Fig. 5). Type B aggregates occurred more frequently in cells expressing wild-type polyQ variants and were present in all fragment lengths including and beyond N513. Type C aggregates occurred as an array of multiple small puncta (less than 8 μm in width) scattered throughout the cytoplasm (Fig. 3A). The notable feature of this aggregation pattern was the dispersed foci through the cytoplasm (Fig. 5), which are reminiscent of stress granules and/or autophagic vacuoles. Type C aggregates were formed by all Htt lengths including and beyond N171, and were in general more abundant in cells with normal polyQ length (Fig. 4). The fourth category, M appeared to include Type C inclusions concurrently with Type A or B inclusions. The M type occurred in both expanded and unexpanded Q-lengths of fragments longer than and including N536 (Fig. 4).
The dispersed pattern of the Type B and C aggregates appeared similar to previously characterized membrane associated (full length) Htt in autophagic vacuoles [31, 32]. These vacuoles-Htt structures were previously found to generally lack immunoreactivity to ubiquitin, which is a marker of the inclusions observed in vivo [2, 3] and were positive for cathepsin B, which is a marker of lysosomes [31]. Neither Type A, B and C showed immunoreactivity consistent with these prior studies, suggesting the aggregates formed from the fragments are distinct and probably protein-based rather than lipid-associated or autophagic in nature (Fig. 6A-C). Specifically, immunostaining of ubiquitin showed Type A inclusions to colozalize peripherally with ubiquitin (Fig. 6A). Type B and C inclusions showed a mild colocalization of ubiquitin staining within the inclusions (Fig. 6A). Apart from the rare exception, inclusions for all fragment lengths and polyQ lengths lacked colocalization with cathepsin D above background levels and mostly appeared to exclude the cathepsinD (Fig. 6C).
We next assessed whether aggregate type was dependent on expression levels of the Htt fragments. Because the expression level of short fragments of less than (and including) N171 was at least 4-fold higher than the longer fragments we analyzed these fragments as two separate groups (Shorter fragment set shown inFig. 7Aand the longer set inFig. 7B). We calibrated the expression levels based on the co-transfected mCherry as a tracer for gene dosage to avoid complications of non-linearity of the EGFP tag on the Htt fragments as they aggregate. For all fragment lengths cells with aggregates were skewed to higher expression levels, which is consistent with an expression level dependence on aggregation. However, for Type B and C inclusions there was no difference in expression level dependence between polyQ lengths for each fragment (except N171; the implications of this exception are discussed further in the discussion), further supporting the independence of these aggregate structures from polyQ-mediated effects. The expression level dependence on Type A was far greater than the other inclusion types, consistent with a polyQ-mediated aggregation mechanism involving amyloid kinetics.
Because these results suggested Type A inclusions are formed via the expanded polyQ sequence and the others are not, we sought to test whether amyloid kinetics of polyQ could redirect the process of aggregation away from the other inclusion types. For this experiment we co-expressed N90 (in 97Q form) fused to mCherry with N586 in either 25Q or 97Q forms (as EGFP fusions). The N90 inclusions poorly overlapped with the N586 25Q inclusions consistent with them forming through distinct assembly mechanisms (Fig. 8). However, the N586 in 97Q form colocalized extensively around the Type A N90 inclusions. Furthermore, these cells displayed no Type B or C inclusions of N586, which suggested that the polyQ-mediated aggregation kinetically drove aggregation away from the other molecular processes of aggregation upon a template of N90 amyloid structure.
DISCUSSION
In this study, we described the aggregation process of a range of fragments of Htt that have been used commonly as models of HD pathogenesis. The results have a number of important implications (discussed below in more detail). But before discussion of those points, we stress that we do not imply that these aggregation patterns necessarily reflect the biological mechanisms involving endogenous full length Htt and its proteolysis. Rather we see our results as highlighting the challenges and caveats of using ectopically expressed fragments of Htt to model HD in terms of understanding the relationship between protein aggregation and pathogenesis.
The first key point is that the fragments have inherent information that drives quite distinct patterns of aggregation into visible puncta. Fragments in length up to N117 formed visible aggregates (Type A) strictly in accord with the polyQ expansion in three cell culture models tested (AD293, COS-1 and Neuro2a). However, surprisingly for fragments of N171 and beyond the extent of aggregation was not overly influenced by polyQ expansion suggesting that these longer fragments contain inherently aggregation prone sequences outside the polyQ motif. The extent of aggregation driven by this non-polyQ sequence (at 10% of the transfected cell population) was not trivial in that it was similar to the extent of aggregation of other aggregating proteins involved in disease expressed under similar conditions (including superoxide dismutase 1 and polyalanine) [33, 34].
As a point of contrast previous studies of N171 containing a short polyQ sequence showed a notably lower level of aggregation than in our study [35, 36]. The differences may arise from differences in tagging design and expression vectors. For example N171 expressed with a myc tag rather than GFP (as we had done) revealed a small fraction of cells with visible aggregates (∼6%) for the 25Q length [36]. Also, that study utilized the pcDNA3.1 vector, which expresses several fold lower levels of proteins than the pTIREX vector used in our study even though both drive expression from a CMV promoter (unpublished observations). Kegel et al expressed Htt fused at the N-terminus with a FLAG tag [35]. This study did not report quantitative results on whether this construct aggregated with a short polyQ length (only one cell showing diffuse staining). Hence, it appears that different tags and expression regimes may affect the susceptibility of Htt fragments to aggregate via non-polyQ mediated mechanisms.
The mechanism of aggregation of the longer fragments (i.e. Type B and C aggregates) appears to be driven by a non-amyloid mechanism akin to what we previously described for polyalanine and superoxide dismutase 1 [33, 34]. Expanded polyalanine has a strong tendency to assemble into α-helical clusters which we previously postulated would cause a derangement of normal oligomeric state as a mechanism of dysfunction [34]. Our work here points to a conceptually similar mechanism (but not necessarily involving α-helices) to describe the behavior of the longer N-terminal Htt fragments. This is an important conclusion because it suggests that such aggregates will confer different impacts to cell health than that ascribed to the “gain of toxic” properties for amyloid structures (including amyloidogenic oligomers) [37–39]. This is of most relevance to the study of aggregation in HD models expressing longer proteolytic sequences ectopically, whereby non-polyQ mediated aggregation mechanisms are predicted to dominate the phenotype.
The differential appearance of Type B and C aggregates upon increasing fragment length suggests two different domains drive clustering into these inclusion categories in a competitive manner to the polyQ-driven Type A inclusion. One lies between residues 117 and 171 to dictate Type C aggregation (Fig. 9). This domain could potentially have a normal biological function in fostering Htt engagement into RNA granules, which have a similar morphology to the Type C inclusions [40]. Prior work has suggested that N590 of Htt (in both wild-type and expanded polyQ lengths) can bind RNA in RNA granule subtype P-bodies and participate in regulating mRNA transport in neuron dendrites, which supports this conclusion [41–43]. The first 170 residues of Htt (but not N90) can also interact with other proteins, including G protein-coupled receptor kinase-interacting protein GIT1, which can modulate the aggregation of polyQ-expanded N170 and which may target it to membranous cellular compartments [44]. The other domain driving Type B inclusions lies between residues 171 and 513. A candidate motif for this clustering are the three HEAT repeats, which operate as protein-protein and protein-nucleic acid interaction domains [45, 46]. Prior work has suggested that the HEAT repeat regions in the Htt sequence position of 172–372 can mediate electrostatic interactions with acidic phospholipids, and hence self-association may also involve interactions withlipids [35].
The formation of three competing clustering strategies has a number of implications. The first is that Type B and C clustering mechanisms likely suppress de novo Type A aggregation mechanisms. In vitro, flanking domains can sterically block Htt N90 aggregation into amyloid fibrils, and hence Type B and C clustering may operate similarly [47, 48] (Fig. 9). However, once the Type A aggregation mechanism is seeded, the amyloid template provides a dominating reaction kinetic that outcompetes Type B and C clustering mechanisms (as we observed in Fig. 8). Without the amyloid template, spontaneous conversion to amyloids occurs at a much lower incidence. Should this mechanism of aggregation be present in the in vivo context, it may influence the spreading progression of Huntington’s disease pathology, and invoke a phenotype similar to that proposed for prion propagation of protein aggregate pathology [49].
Another implication, if these patterns of aggregation do in fact form from natural cleavage products, is that the appearance of aggregates in pathology may reflect the outcomes of three different clustering processes with different impacts on cell health and survival. This is an important concept because while Htt aggregation into inclusions is a key signature of human pathology, it remains unclear as to whether inclusions form as part of a pathogenic process or alternatively as an adaptive response (or both in different circumstances). In brain, some neuron types accumulate inclusions without apparent loss of survival; whereas other types are lost without obvious inclusion formation (reviewed in [50]). Alternative clustering patterns may involve strategies previously described for differential partitioning of misfolded proteins from amyloid proteins into two distinct foci [51]. The focus comprising misfolded proteins, known as JUxta Nuclear Quality control (JUNQ), is postulated to regulate remedial refolding or degradation of misfolded proteins [51]. The other focus, the Insoluble Protein Deposit (IPOD), operates as a depot for terminally aggregated amyloid proteins including polyQ [51]. Our data is consistent with the aggregation of Type A aggregates falling into the IPOD category and Type B and C falling into the JUNQ category. Hence the multiple inclusion categories could reflect intermediate states or a combinatorial clustering mechanism involving protein self-association and cell-mediated clustering. Hence further knowledge of the how visible aggregates are formed mechanistically is required to properly ascertain the role aggregation plays in pathology.
In conclusion this study has unmasked a complex pattern of self-association of N-terminal Htt fragments driven by two different motifs located between residues 117 and 586. These two types of aggregation are independent of the aggregation caused by polyQ expansion and appear to compete (and indeed largely inhibit) classical polyQ-mediated aggregation. We anticipate these findings to have important ramifications in other studies that relate crude aggregation patterns to the pathogenic process. In particular our results demonstrate the need for a more nuanced assessment of the common models for toxicity involving protein misfolding and aggregation.
AUTHOR’S CONTRIBUTIONS
ZC, SAM and ARO designed and performed the experiments and undertook the data analysis, SAM cloned the constructs and prepared the vectors, PJM provided the initial concepts and assisted in preparing the manuscript and DMH oversaw implementation of the experiments, interpretation of the data and co-wrote the manuscript with ZC.
CONFLICT OF INTEREST
The authors have no conflict of interest to report.
Footnotes
Sequence numbering using Htt with 23Q length. This numbering convention is used hereon.
Nx Nomenclature for different Htt N-terminal fragments cleaved after the x residue.
ACKNOWLEDGMENTS
We would like to thank Steven Finkbeiner from the Gladstone Institute of Neurological Disease for provision of the pGW1-based N90 vectors fused to mCherry. This work was funded by grants to DMH from the Australian Research Council Discovery: Future Fellowships FT120100039, National Health and Medical Research Council Project grants APP1049458, APP1049459 and APP1102059 and the Hereditary Disease Foundation. These funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
