Abstract
Intracellular accumulation of aggregated amyloid-β, misfolded and non-functional proteinopathy, is the hallmark feature in Alzheimer’s disease (AD). There are several mechanisms to clear the amyloid burden in a cell, including transcytosis across the blood-brain barrier, immune mediated, lysosomal pathway associated autophagy, enzymatic degradation by insulin degrading enzyme/neprilysin, and the proteasomal pathway. Among them, the ubiquitin proteasome system (UPS) is playing a critical role to prevent the intracellular amyloid-β deposition and to clear off the cellular burden in association with ubiquitin E3 ligase enzymes in AD. For ubiquitination, lysine moiety in a protein acts like a docking site for the attachment of ubiquitin molecule and different lysine residues act differently in this reaction. Therefore, it is pertinent to understand and link the role of various lysine residues along with their effector molecules, for instance, E3 ligases PARK2 and STUB1 in the ubiquitination cascade. Herein, we 1) modeled the structure of AβPP and determined its topologies and studied the impact of lysine residues in AβPP stability, 2) reported K351 as the most promising target for AβPP ubiquitination, 3) investigated the plausible role of lysine residues in non-covalent interactions mediated ubiquitin positioning in the ubiquitination, 4) detected conserved amino acids that is crucial for AβPP ubiquitination, and 5) identified the key ubiquitination enzymes and their interaction network playing major role in the ubiquitination of AβPP.
INTRODUCTION
Alzheimer’s disease (AD) is characterized by the aberrant protein accumulation and aggregates such as amyloid-β (Aβ), leading to the development of senile plaques in the brain [1]. In this regard, the amyloid cascade hypothesis affirms the Aβ deposition as an early pathogenic event in the progression of AD [2]. The intra-and extracellular Aβ40/42 deposits perturb the synaptic transmission between neurons and trigger the memory and cognitive decline in AD patients [3]. Here, Aβ and its precursor, AβPP, are the central players in the pathology of AD whose levels are regulated by the protein quality control: ubiquitin proteasome system (UPS). There are other mechanisms that regulate Aβ clearance including enzymatic pathways utilizing insulin degrading enzyme and neprilysin [4], immune-mediated trans-cytosis across the blood-brain barrier [5], and the lysosomal pathway through autophagy [6]. However, the ubiquitination pathway is central to regulate the level of proteins involved in all these pathways; therefore, it has a great importance in the cellular biology for the clearance of Aβ. The ubiquitination process involves the selection of key lysine residues of target protein by E3 ligases for ubiquitin attachment. This ubiquitin ligation at key lysine and their type of poly-ubiquitin chain determines the cellular fate of the target protein [7]. For instance, studies have identified K6 poly-Ub-chain to trigger DNA repair responses and K33 poly-Ub-chain to initiate stress responses. Moreover, K63 poly-Ub-chain to govern DNA repair, endocytosis and inflammatory responses while K11, K27, K29, and K48 poly-Ub-chain to elicit proteasomal degradation [8, 9].
In this manner, UPS greatly influences Aβ production through AβPP ubiquitination and through proteasomal degradation of their regulatory enzymes (β- and γ-secretases). Moreover, there is increasing evidence of non-functional UPS, i.e., ubiquitinating and deubiquitinating enzymes and the proteasomal subunits that are responsible for the altered Aβ clearance in AD patients [10]. However, there are numerous reports depicting the ubiquitination of Aβ or AβPP in vitro, ex vivo, and in vivo but their exact site for ubiquitination remained unknown for most cases (Table 1). Moreover, the AβPP ubiquitination research is at the preliminary stage, where much needs to be investigated to unravel the mystery of amyloid burden mitigation. Therefore, identification of the key lysine residues and the ubiquitination components are crucial for determining the mechanism behind Aβ clearance. In this regard, we have identified the key lysine residues having great potential for AβPP ubiquitination. Moreover, the importance of these key lysine residues in AβPP processing, non-covalent interactions with ubiquitin and other functions have also studied. Further, we modeled the 3D structure of holo-AβPP to investigate the impact of key lysine residues on AβPP stability and their mutational disease susceptibility. Nevertheless, we have also identified the potential ubiquitination enzymes, E1s, E2s, E3s, and DUBs, and their complex interplay in the ubiquitination process of AβPP. In summary, this work demonstrates the AβPP ubiquitination mechanism and Aβ clearance to provide novel therapeutic targets against AD.
Experimental studies reporting the ubiquitination of AβPP and Aβ to rescue the amyloid burden in AD
MATERIALS AND METHODS
AβPP structure modeling
The AβPP structure was modeled with a protein homology/analogy recognition tool Phyre2. It included the assembly of homologous sequences by multiple sequence alignments of query sequence with their sequence homologs and their consequent secondary structure prediction by PSI-pred pooled query hidden Markov model. Further, fold library was scanned for crude backbone construction based on top scoring alignments. Further, loop modeling for correcting insertions and deletions in the models followed by side chain placement to backbone to obtain a final protein structure model [28]. Here, six templates (c3ktmB, c3dxeB, c2yszA, d1rw6a, c1amlA, c2lp1A) were employed to model the AβPP protein structure based on heuristics method to maximize the confidence, percentage identity, and alignment coverage to the queried sequence.
AβPP domain analysis
The protein domain analysis was done to identify the functional sites in the AβPP with MOTIF Search tool. It extracted domain results on the basis of matched bit score for AβPP sequence and each domain found in Pfam library with an E-value cut-off of 0.001 [29].
Structure refinement simulations
The predicted 3D-structure of AβPP was refined with an atomic-level, high-resolution protein structure refinement tool-ModRefiner [30]. We performed the conformational search for the backbone and side chain atoms to simulate the predicted model to their native state in terms of backbone topology, side chain positioning and hydrogen bonds directed by combined physics-and knowledge-based force field energy calculations.
AβPP structural validations through PROVE, ERRAT, VERIFY 3D, and RAMPAGE
The reliability of 3D-atomic models of predicted AβPP validated through the stereo chemical quality measurements of modeled proteins with different structural validation programs.
Identification of potential ubiquitination sites in AβPP
The important lysine residues for AβPP ubiquitination was identified with four different approaches, including 1) the sequence identity between AβPP and ubiquitin lysine sites, 2) sequence similarity between AβPP and ubiquitin lysine sites, 3) ubiquitin-ubiquitin and ubiquitin-AβPP lysine peptides docking, and 4) machine learning based ubiquitination site predictions.
Sequence identity analysis
Pairwise sequence alignment was performed using BioEdit Sequence Alignment Editor Software [35] using BLOSUM 62 substitution matrix. Further, it calculated the identity scores among the 21 window-size central lysine’s neighboring residues of ubiquitin and AβPP sequences.
Sequence similarity analysis
Conserved sequence analysis of amino acid residues neighboring lysine sites was performed by aligning the 21 window-size multiple lysine site sequences in AβPP by the BioEdit Sequence Alignment Editor Software [35] at different thresholds to identify the important residues common to both ubiquitin and AβPP critical for the ubiquitination.
Ubiquitin-AβPP lysine peptides docking
The protein-peptide interactions of ubiquitin protein and AβPP lysine peptides were modeled by flexible docking approach using CABS-dock web server. It executed simulation searching for the binding sites in the receptor protein allowing full flexibility to the peptides being docked [36]. For ubiquitin-AβPP peptide docking, 21 window-size peptide sequences were prepared by taking 10 amino acid residues on either side of the lysine sites from AβPP protein as test set and from ubiquitin protein as control set, respectively. Further, ubiquitin protein (PDB ID: 1UBQ) is flexibly docked with both the test set peptides, and the control set peptides to identify the potential lysine for ubiquitination.
Machine learning techniques: UbiSite, BDM-PUB, hCKSAAP_UBSITE, UbPred, UbiPred
The potential ubiquitination sites for AβPP was predicted by diverse machine learning tools, including UbiSite, BDM-PUB, CKSAAP, UbPred, and UbiPred on the basis of the experimentally verified ubiquitination site datasets.
Functional studies of lysine’s impact on AβPP ubiquitination
The refined AβPP structure was incorporated with both positive (Arginine (R), Histidine (H)) and negative (Aspartic acid (D), Glutamic acid (E)) charged mutations at their lysine sites by Pymol software [42]. The above mutations were studied for their impact on AβPP stability, ubiquitination, modifications, and disease susceptibility.
Investigation of AβPP stability upon lysine mutations
The refined AβPP model and mutated AβPP (Lysine (K)⟶Arginine(R)/Histidine(H)/Aspartic acid(D)/Glutamic acid(E)) analyzed for their total force field energies by Swiss PDB viewer 4.0.2 software [43]. The variations in the total force field energies were estimated to observe the impact of lysine mutations on the amyloid beta precursor protein’s stability.
Physico-chemical property analysis for AβPP ubiquitination
The physico-chemical properties of Aβ, AβPP, and ubiquitin proteins were computationally determined from their peptide sequences by ExPASy server tool ProtParam [44]. It computed various physico-chemical properties, including the atomic and amino acid compositions along with the count of positively and negatively charged residues, the instability index, aliphatic index, and the grand average of hydropathicity of the desired proteins. The physico-chemical properties of AβPP was compared with ubiquitin to investigate the crucial parameters important for the ubiquitination.
Examination of lysine sites crucial for AβPP modifications
The molecular mechanisms associated with the pathogenic amino acid substitutions in AβPP were identified by a machine-learning tool MutPred [45]. It quantified the pathogenicity of mutations based on the probabilistic modeling of a large repertoire of structural-functional alterations, including the disruptions in structure, stability, macromolecular binding, and post-translational modification sites in amino acid sequences.
Mutational analysis of lysine residues for disease susceptibility
The functional impacts of the introduced mutations were studied with different mutation analysis tools, including PANTHER [46], SNAP2 [47], Polyphen2 [48], Pmut [49], PhD-SNP [50], and SIFT [51]. The obtained results were transformed into numerical values to analyze them on the stacked bar graph. For instance, results with “Probably Benign”, “Neutral”, and “Tolerated” were assigned with ‘0’ numerical value and “Possibly Damaging” with ‘1’ numerical value. Similarly, results with “Probably Damaging”, “Effect”, and Not tolerated” were assigned with “2” numerical value and the threshold was taken ‘6’, i.e., more than half prediction tools to predict high confidence lysine having disease susceptibility.
Characterization of ubiquitination pattern of AβPP
The ubiquitination pattern of AβPP was investigated through the combination of multiple studies, including 1) the AβPP-ubiquitin interaction studies, and 2) the prediction of ubiquitination enzyme’s interactional network for AβPP or Aβ clearances in AD biology.
AβPP-ubiquitin interaction prediction and interface residue analysis
The interactions among the different domains of AβPP and ubiquitin proteins were predicted by the protein-protein interaction prediction server-PSOPIA [52]. It compared the sequence similarities to a known interacting protein pair, statistical propensities of the domain-domain interactions, and the sum of edge weights along the shortest path between the homologous proteins in a PPI network. Moreover, the potential ubiquitin interacting domains of AβPP (PDB IDs: 4JFN_A, 2FKL_A, 1AAP_A, 3UMK_A, 1IYT_A, 2LP1_A) were docked with ubiquitin (PDB ID: 1UBQ) by rigid body docking approach using Fast Fourier Transformation methodology of GRAMM-X software [53]. Furthermore, their docked interface residues having distance <4.5A were identified by Pymol software [42] to identify the key lysine residues important for ubiquitin and AβPP interactions.
Prediction of ubiquitination enzymes interactional network for AβPP
The potential ubiquitination enzymes regulating the AD biology of Aβ and AβPP proteasomal clearance were identified by determining the interaction among all the ubiquitin E1-activating enzymes, E2-conjugating enzymes, E3-ligating enzymes, and deubiquitinating enzymes with amyloid forming proteins: AβPP, β-secretases and γ-secretases. Further, the protein-protein interactional network among the identified proteins was designed by functional protein association network prediction STRING tool [54]. Lastly, the overall methodology along with the tools and techniques employed in this study has shown in Fig. 1.

Flow chart depicting the tools and techniques employed for the study of AβPP ubiquitination.
RESULTS
3D-model of AβPP
The three-dimensional structure of AβPP is predicted to analyze the potential lysine residues important for their ubiquitination, since its full experimental structure was lacking. The structural topology of the modeled AβPP has identified an N-terminal signal peptide MLPGLALLLLAAWTARALEVPT (1–22), an extracellular region (1–698), a trans-membrane helix S1 (699–721), and a cytoplasmic region (722–770). Further, domain analysis of full length AβPP (770aa) revealed six functional domains, namely: 1) N-terminal domain, 2) Copper binding domain, 3) Protease inhibitor domain, 4) E2 domain, 5) Aβ peptide domain, and 6) C-terminal domain. The combined results suggest that the first four domains parses in the extracellular region and the Aβ peptide in the trans-membrane region while the C-terminal domain in the cytoplasmic region. Moreover, the modeled structure was spanned by 22 alpha-helices (37%), 7 beta-strands (4%), and one trans-membrane alpha-helix (3%). In addition, the tertiary structure of the AβPP model is enriched with two disulfide linkages one at cysteine144-cystein174 and another at cysteine158-cysteine186 that are very important for their stability. Further, the structural validation of the refined model by PROVE analysis passed their structural quality test with less than 1% buried outlier atoms. Similarly, the sequence information based tertiary structure, i.e., 3D–1D score ≥0.2 was found to be more than 50% for our refined model as per the verify-3D tool. Moreover, the geometrical conformation of the refined model analyzed by RAMPAGE server was found to be greater than 99.5% for the favored and allowed region residues except four amino acids, which included Alanine 35, Aspartic acid 360, Proline 365, and Arginine 653. The refined model of AβPP along with their molecular descriptors and structural refinement simulations is shown in Fig. 2. Further, the structure validation scores of modeled and refined models obtained by verify 3D and RAMPAGE can be accessed from Supplementary Table 1.

AβPP Modeling and Structural Refinement. A) AβPP Model: The modeled AβPP (shown in cartoon and surface view) represented an N-terminal signal peptide (Cyan in color), Trans-membrane helix/domain (Hot pink in color), and a C-terminal region (Purple in color) along with the disulfide bonds (Green/Yellow in color). The domain analysis identified six domains including N-terminal, copper binding, protease inhibitor, E2, Aβ, and C-terminal domain spanned across the full length 770aa protein. The structural topology of the predicted model has identified a membrane spanning trans-membrane helix S1 at 699 to 721 amino acid residues with an extracellular N-terminal (1–699) carrying signal peptide (1–22) and a cytoplasmic C-terminal (721–770). The structural analysis of the predicted model revealed 22 alpha-helices, 7 Beta-sheets, and 1 Trans-membrane helix and 2 disulfide linkages (Cys144–Cys174 and Cys158–Cys186). Moreover, the solvent accessible surface area (30775.238 angstrom∧2) was less than the molecular surface area (80013.477 angstrom∧2) of the model due to inaccessibility of the solvent in trans-membrane region. B) Structural Refinement of AβPP: The modeled AβPP was refined with structural refinement simulations up to 72 cycles (designated P1,P2, … P72) and validated for the refinements by different structural validation programs-PROVE, ERRAT, Verify-3D, and RAMPAGE. C) AβPP Ramachandran Plot: Evaluated 99.5% residues in geometrical (Φ–ψ) favored (719 residues (93.6%)) + allowed residues (45 (5.9%)) regions while only four residues Ala35, Asp360, Pro365, and Arg653 ∼0.5% in outlier region; other structural valition programs including PROVE and ERRAT identified to PASS the modeled structure for their overall quality. D) Verify-3D: The 3D atomic model compatibility assessment revealed the refinement of modeled AβPP from zero cycle (P)– 33.12% to 72 cycle (P72)– 43.9% residues, with 50.13% residues of the best obtained model P30 at 30th cycle showing 2D–3D structural compatibility. E) RAMPAGE: The geometrical validation of C-alpha neighboring residues revealed the refinement from zero cycle (P)– 655 (favored + allowed residues) and 113 (outlier residues) to 72 cycle (P72)– 764 (favored + allowed residues) and 4 (outlier residues). The best geometrical configuration with minimum simulations was achieved at 30th cycle (P30) signifying it as a best-predicted model.

Potential lysine prediction for ubiquitination in AβPP based on: A) Sequence Identity: The sequence identity scores among 21 window size polypeptides carrying central lysine (K6-Blue, K11-Pink, K27-Brown, K29-Green, K33-Yellow, K48-Red, K63-Purple) in ubiquitin and (K51, K60, K66, K99, K103, K106, K132, K134, K155, K161, K178, K224, K315, K351, K363, K377, K393, K395, K401, K421, K425, K428, K429, K438, K495, K496, K503, K510, K521, K522, K568, K601, K662, K670, K687, K699, K724, K725, K726, K751, K763) in AβPP are plotted on a bar graph. The identity scores greater than 0.2 represented the conservation of more than two amino acid residues in the aligned sequences signifying the presence of potential lysine for ubiquitination based on sequence identity (shown in Red)- K60, K66, K99, K106, K132, K134, K155, K161, K351, K363, K377, K393, K395, K401, K503, K510, K522, K568, K601, K662, K670, K687, K699, K724, K725, K726. B) Sequence Similarity: The lysine neighboring amino acid conservation analysis among 21 window size polypeptides carrying central lysine have identified the conservation of Asp(D), Glu(E), Ile(I), Leu(L), and Val(V) at 57% similarity threshold and Asp(D), Gln(Q), Glu(E), Ile(I), Leu(L), Met(M), Thr(T), and Val(V) at 42% similarity threshold in ubiquitin while the conservation of Gln(Q) and Glu(E) at 34% similarity threshold; Gln(Q), Glu(E), Ile(I), Leu(L), Met(M), and Val(V) at 31% similarity threshold and Asn(N), Gln(Q), Glu(E), Ile(I), Leu(L), Met(M), and Val(V) at 29% similarity threshold in AβPP. The amino acid residues conserved at the common sites of both ubiquitin and AβPP are enclosed by red rounded rectangles signifying the critical amino acids for ubiquitination including Glu (E), Gln (Q), Val (V), Leu (L), Met (M), and Ile (I). Further, on the basis of best conserved amino acids, i.e., glutamic acid and glutamine common to both ubiquitin and AβPP (marked by yellow arrow), the potential lysine for ubiquitination are identified and encircled in pink rectangles including K224, K351, K377, K393, K401, K503, K510, K522, K662, K724, K751, and K763.
K351 is the most promising target for AβPP ubiquitination
The potential lysine sites which are favorable for the ubiquitination of AβPP are identified by four different methodologies, including sequence identity, sequence similarity, protein-peptide docking, and machine learning techniques. Since, ubiquitin is the pre-eminent protein that is ubiquitinated the most in any cellular processes due to the poly-ubiquitination phenomenon; therefore, its lysine site features are of great importance to unravel the ubiquitination mystery. These informative sites of ubiquitin are utilized by above mentioned methodologies to deduce the potential lysine sites in AβPP.
Sequence identity based potential lysine in AβPP
The sequence identity between the 21 window-size peptide sequence with central lysine at seven lysine sites of ubiquitin and forty-one lysine sites of AβPP is determined to infer the promising ubiquitination sites in AβPP. The sequence-identity of at least three amino acids, i.e., identity score >0.2 was taken as the threshold, which identified 26 key lysine sites in AβPP including K60, K66, K99, K106, K132, K134, K155, K161, K351, K363, K377, K393, K395, K401, K503, K510, K522, K568, K601, K662, K670, K687, K699, K724, K725, and K726 as favorable sites for the ubiquitination (Fig. 3A). Among them these seven K60, K66, K351, K363, K601, K662, and K687 lysine sites displayed the higher identity scores. For detailed sequence identity scores obtained for each pairwise sequence alignment, Supplementary Table 2 can be explored.
Sequence similarity based potential lysine in AβPP
The sequence conservation analysis among lysine neighboring residues in ubiquitin revealed the conservation of hydrophobic leucine (L), isoleucine (I), valine (V) and polar negatively charged aspartic acid (D), glutamic acid (E) at 57% similarity threshold. Further, the conservation in ubiquitin at 42% similarity threshold provided the majority of informative amino acid residues in common with AβPP that are crucial for the ubiquitination process. These conserved residues included the hydrophobic methionine (M), leucine (L), isoleucine (I), and valine (V), polar negatively charged glutamic acid (E), and polar uncharged glutamine (Q). Moreover, in AβPP only glutamic acid (E) and glutamine (Q) is conserved at the higher threshold of 34% sequence similarity, which depicted K224, K351, K377, K393, K401, K503, K510, K522, K662, K724, K751, and K763 as potential lysine sites (Fig. 3B).
Ubiquitin-AβPP lysine site peptide docking based potential lysine in AβPP
The affinity of lysine specific ubiquitin-ubiquitin interaction in the poly-ubiquitination process is taken as the principal to investigate the potential lysine sites in AβPP important for ubiquitination. The flexible docking of ubiquitin with the best conformations of ubiquitin’s seven lysine peptides, K6, K11, K27, K29, K33, K48, and K63, resulted in the average root mean square deviation (Avg. RMSD) of 2.92, 5.31, 4.92, 3.88, 4.49, 1.96, and 4.63 respectively. Further, the obtained docking results of ubiquitin with ubiquitin and AβPP lysine peptides are summarized in Table 2. Interestingly, the Avg. RMSD of K48 was minimal, i.e., 1.96 showing the best affinity for K48 linked poly-ubiquitination than the K11 site, which had highest Avg. RMSD of 5.31 implying the least propensity of K11 linked poly-ubiquitination. Here, the minimal Avg. RMSD, i.e., best binding affinity than the K11 (5.31) was taken as a threshold to find the best propensity ubiquitination sites in AβPP. The 21 potential sites were identified with good propensities for ubiquitination, including K51, K60, K99, K132, K161, K178, K351, K393, K401, K425, K495, K496, K503, K510, K521, K522, K568, K687, K724, K725, and K726 (Fig. 4A). Among them, the lowest Avg. RMSD of 0.906538 was obtained for the K687 site in AβPP or corresponding K16 site in Aβ showing best potential for ubiquitination.
Ubiquitin and AβPP lysine peptides docking scores with ubiquitin protein using CABS-dock
Machine learning ubiquitination tools based potential lysine in AβPP
The different machine learning approaches, including maximal dependence decomposition, Bayesian discriminant analysis, random forest models, and support vector machine classifier-based tools (UbiSite, BDM-PUB, CKSAAP, UbPred and UbiPred) have been employed to determine the potential lysine sites in AβPP for ubiquitination on the basis of experimentally verified ubiquitination site datasets. The predicted ubiquitination sites and scores obtained by UbiSite, BDM-PUB, CKSAAP, UbPred, and UbiPred tools are summarized in Table 3. Moreover, the stacked bar graph of the ubiquitination prediction scores versus AβPP lysine sites are plotted to identify the potential lysine in AβPP as shown in Fig. 4B. The best ubiquitination aggregate scores were obtained for K351 and K377, i.e., 3.19 and 2.47, respectively, with maximal predictions by four tools: UbiSite, BDM-PUB, UbPred, and UbiPred.
Ubiquitination Prediction Scores of Ubiquitination-site Prediction Tools
Further, these potential lysine residues were classified into different confidence levels: very high, high, medium, low, and very low based on the evidence from ubiquitination prediction tools and their verification by other methods, including sequence identity, sequence similarity, and flexible protein-peptide docking. For instance, the potential ubiquitination sites predicted by at least three ubiquitination prediction tools and is verified by at least any two other methods are classified as “Very High” confidence, while those verified by any one of the other methods are classified as “High” confidence. In a similar way, the potential ubiquitination sites predicted by any two ubiquitination prediction tools and are verified by at least any one of the other methods are categorized into “Medium” confidence. Additionally, the potential ubiquitination sites predicted by any one of the ubiquitination prediction tools, which is verified by at least, any two of the other methods, are assigned as “Low” confidence whereas if verified by any one of the other methods then classified as “Very Low” confidence. The key lysine residues important for the ubiquitination of AβPP with predicted confidence are listed in Table 4. Moreover, the Venn diagrams comprehensively reviewed the predicted ubiquitination sites and illustrated the logical relations among the key ubiquitination sites in AβPP (Fig. 4C). The Venn diagram analysis clearly depicted K351 as the most promising ubiquitination site at high confidence; K393, K401, K510, and K522 at medium confidence; and K503 and K724 at low confidence.
Predicted Confidence of the Key Ubiquitination sites in AβPP

Potential lysine prediction in AβPP and their comparative analysis: A) Ubiquitin and K-AβPP site Peptide Docking: The average root mean square deviation (Avg. RMSD) graph of the docked 21 window size K-site AβPP peptide with ubiquitin is dot plotted and is compared with the Avg. RMSD of the docked 21 window size K-site Ubiquitin peptide with the ubiquitin protein. The Avg. RMSD of K-11 ubiquitin, i.e., 5.31, is taken as threshold to identify potential ubiquitination site in AβPP (K51, K60, K99, K132, K161, K178, K351, K393, K401, K425, K495, K496, K503, K510, K521, K522, K568, K687, K724, K725, and K726) with lower Avg. RMSD values. B) Ubiquitination Prediction Tools: The machine learning based ubiquitination prediction tools-UbiPred (Light Blue), UbPred (Purple), CKSAAP (Green), BDM-PUB (Dark Red), and UbiSite (Blue) has predicted potential ubiquitination sites in AβPP including K51, K60, K103, K134, K155, K161, K224, K351, K363, K377, K393, K395, K401, K421, K425, K428, K429, K438, K495, K496, K503, K510, K521, K522, K568, K601, K662, K670, K699, K724, K751, and K763. C) Comparative Analysis of Ubiquitination Sites: The comparison of the potential ubiquitination sites predicted by all the methods have revealed the most potential ubiquitination sites which is predicted by at least three ubiquitination prediction tools and verified by at least any two other methods are K351 and K377 (marked Red) with very high confidence, while verified by any one of the other methods are K224, K363, K601, K751, and K763 (marked Red) with high confidence. Similarly, the potential ubiquitination sites predicted by any two ubiquitination prediction tools and verified by at least any one of the other methods are K60, K161, K393, K401, K496, K510, K522, and K662 (marked Red) with medium confidence. Likewise, the potential ubiquitination sites predicted by any one of the ubiquitination prediction tool and is verified by at least any two of the other methods are K503, K568, and K724 (marked Red) with low confidence while verified by any one of the other methods are K51, K134, K155, K395, K425, K495, K521, K670, and K699 (marked Red) with very low confidence.
Lysine residues are crucial for AβPP stability, ubiquitination, and other functions
The mutational studies on AβPP revealed the importance of lysine residues in AβPP processing and ubiquitination. When we mutated the key lysine (Polar; positively charged; basic amino acid; pK = 10.5) with other polar; positively charged; basic amino acids-Arginine (pK = 12.5), Histidine (pK = 6.0) and polar; negatively charged; acidic amino acids-Aspartate (pK = 3.9) and Glutamate (pK = 4.2), we observed some interesting effects on the internal potential energy of mutated AβPP summarized in Table 5. We found that only arginine had imparted site-specific stability to even more than half of the predicted ubiquitination sites, including K51, K60, K134, K161, K224, K393, K401, K425, K496, K510, K521, K522, K699, and K724, and instability at the rest site of AβPP (Fig. 5A). Moreover, the lysine residues at high confidence ubiquitination sites were found intolerant towards all mutations against the stability of AβPP. Apart from arginine, histidine was observed to greatly impact the stability of AβPP followed by glutamate and aspartate. However, the effect of glutamate on AβPP stability was less than the arginine and histidine, but it affected nearly all the ubiquitination sites (Fig. 5A). Further, the detailed internal potential energies obtained for the modeled and mutated AβPP, and the mutational impact of lysine mutations on their total potential energies can be inferred from Supplementary Tables 3 and 4, respectively.
Effect of Lysine Mutation on Total Potential Energy of AβPP
Further, the disease susceptibility of lysine mutations were checked through the mutational analysis tools, PANTHER, SNAP2, Polyphen2, PMut, Phd-SNP, and SIFT. It revealed that mutations had an effect on all the sites, but they had very least effect on the high confidence ubiquitination sites in comparison with the medium and low confidence sites. The highly intolerant mutations that were most susceptible to the diseases are shown in Fig. 5B. In addition, the detailed results for the mutation associated disease susceptibility predicted by PANTHER, SNAP2, Polyphen2, PMut, PhD-SNP, and SIFT can be accessed through Supplementary Table 5. Further, we analyzed the physico-chemical and amino acid compositional analysis of ubiquitin, Aβ42 peptide, and AβPP to understand the crucial factors that determine the ubiquitination of a protein. Interestingly we found that total number of positively and negatively charged residues were in same proportion in ubiquitin along with the aliphatic index-100 and GRAVY-– 0.489 which could be the decisive factor for ubiquitin to be the top ubiquitination protein. Moreover, some amino acid compositions, including arginine, asparagine, aspartate, glutamate, phenyl-alanine, proline, and serine were found in the comparable range±1% in ubiquitin, Aβ42 peptide, and AβPP (Fig. 5C). Further, lysine sites in AβPP were investigated for their role in the other cellular processes and functions by MutPred, which has been outlined in Table 6. The collective results depicted that apart from ubiquitination; lysine residues are also crucial for protein stability, modifications, and other functions.
Importance of Lysine residues in AβPP Ubiquitination, Processing and other Functions
Can lysine mediated non-covalent interactions prevail ubiquitin positioning and ubiquitin-AβPP conjugation during ubiquitination?
The non-covalent interactions of ubiquitin with different proteins, such as human DNA repair proteins, insulin-degrading enzyme, and SUMO protein, are well known for modulating their activity or correctly orienting ubiquitin for lysine specific conjugation [55 –57]. These interactions can also impart stability to the ubiquitin-protein conjugates in the ubiquitination process [58]. In this regard, we predicted the interaction among all the domains of AβPP and ubiquitin and examined them for the lysine mediated polar interactions among their interacting residues. The ubiquitin-AβPP interaction predictions reported that ubiquitin has nearly same interactive affinity toward all the domains of AβPP except Aβ peptide. The prediction of ubiquitin-AβPP interactions was highest on the basis of homologous protein interactional network (SNet), rather than by the statistical domain-domain interactions (SDom) and sequence similarity based interacting protein pairs (SSeq) shown in Fig. 6A. Further, lysine investigation in non-covalent electrostatic, polar interactions revealed the presence of lysine interactions with N-terminal domain (K66, K99), Copper binding domain (K155), E2 domain (K447), Aβ peptide (K28), and C-terminal domain of AβPP (K687). Apart from AβPP, ubiquitin’s lysine residues, including K6, K11, K33, K48, and K63 were also involved in the polar interactions (Fig. 6B). The presence of lysine as interacting residues provided a clue for their role in ubiquitin positioning or ubiquitin-AβPP conjugation but further researches are required for clear understanding. These informative residues are the foundations for future avenues of AβPP ubiquitination mechanistic research.
Interaction network of ubiquitination proteins for AβPP clearance

Mutational Analysis of Lysine residues for AβPP Ubiquitination. A) Impact of Lysine Mutation on AβPP Stability: The total energy change in AβPP upon Lysine mutations have identified the prominent effect of Lysine-Arginine and Lysine-Histidine mutations as the most detrimental to AβPP stability in comparison with Glutamic and Aspartic acid. Here, nearly 14 Lysine-Arginine mutations including K51, K60, K134, K161, K224, K393, K401, K425, K496, K510, K521, K522, K699, and K724 are reported to increase the stability of AβPP by reducing their total energy in KJ/mol. B) Lysine Mutation and Disease Susceptibility: The investigation by various mutation analysis tools including PANTHER, SNAP2, Polyphen2, Pmut, PhD-SNP, and SIFT have identified the key lysine sites where mutations are susceptible to disease pathogenesis and are marked with red color taking the threshold of predictions by more than three tools for best accuracy. The highly disease susceptible mutations include K51H/D/E, K60H/D/E, K134H/D/E, K155H/D/E, K161D, K393H/D/E, K395R/H/D/E, K401H/D/E, K425H/D/E, K495H/D/E, K496H/D/E, K503R/H/D/E, K510D/E, K521R/H/D, K522R/H/D/E, K662H, K670D, K699H/D/E, K724H/D/E, K751H/D, and K763H/D/E. C) Physico-chemical Properties and Amino acid Composition: The physico-chemical properties of Ubiquitin showed equal no of positive and negative charged residues, with low hydrophobicity and high stability, while Aβ42 showed high hydrophobicity and stability in comparison to AβPP low hydrophobicity and instability. Amino acid compositional analysis revealed the comparable percentage within 1% range of Ubiquitin with Aβ42 and AβPP including Arginine, Asparagine, Aspartic acid, Glutamic acid, Phenyl alanine, Proline, and Serine.
The accumulation of well-known pathogenic Aβ can be regulated by the clearance of its precursor, AβPP, through ubiquitination. However, ubiquitination is a well-defined process, but the association of different ubiquitination E1s, E2s, E3s, and deubiquitinating enzymes (DUBs) always remained an interesting field of research for target specific protein clearance. Here, we investigated the interaction network of E1s, E2s, E3s, and DUBs enzymes for the clearance of AβPP in AD biology. The interaction network identified ubiquitin E3 ligases, Park2 and STUB1, to be associated with AβPP ubiquitination while deubiquitination was by USP25 and UCHL1 (Fig. 7A). The ubiquitination processing of other enzymes that govern the synthesis of Aβ peptide are also crucial to restrict them. Here, certain E3 ligases like FBXO2 is reported to regulate the ubiquitination of BACE1; TRIM13 to regulate PSENEN; CDH1 to regulate PSEN1 and NCSTN; while TRIM55 and SART1 to regulate the ubiquitination of NCSTN. Likewise, specific deubiquitinase enzymes, including USP25 and UCHL, are found to regulate AβPP; USP8 to BACE1; and USP39 to NCSTN, respectively (Fig. 7A). Further, examinations on the potential E1 activating and E2 conjugating enzymes were done to identify the pre-processors of AβPP ubiquitin ligation by Park2 and STUB1 E3 ligases. We reported an array of ubiquitin E1 activating enzymes UBA1, UBA6, UBA7, and ATG7 that can possibly activate specific ubiquitin E2 conjugating enzymes including UBE2A, UBE2B, UBE2G1, UBE2J1, UBE2K, UBE2L3, UBE2L6, UBE2N, UBE2Q1, UBE2S, UBE2U, and UBE2Z to conjugate lysine to Park2 (Fig. 7B); while additional UBE2I, UBE2T, and UBE2W enzymes to conjugate lysine to STUB1 (Fig. 7C). The PPI network provided us the substantial information about the ubiquitination enzymes associated with the clearance of AβPP in the AD biology that can be further explored for the therapeutic avenues.
DISCUSSION
The structural topology of the modeled AβPP has identified 1) an N-terminal signal peptide of 22AA residues (MLPGLALLLLAAWTARALEVPT) and a trans-membrane segment S1 (699–721). Here, positively charged arginine (Arg16) and the N-terminal hydrophobic amino acids are crucial for AβPP’s post-translational translocation across the ER membrane. Since, the removal of positively charged amino acid from signal peptide selectively impairs the translocation, while the presence of hydrophobic residues favors it through lipid bilayer of ER membrane toward the secretory pathway [59]. 2) The domain analysis revealed six domains that are contributing toward the functionality of AβPP. Among them four domains, including N-terminal, copper binding, protease inhibitor (Kunitz_BPTI), and E2 domain spans over the extracellular surface region and are responsible for copper binding, protease inhibition, platelet aggregation, and hemostasis. While the proteolytic product of cytoplasmic c-terminal domain acts as a transcriptional regulator in neurons. In addition, the cleavage of their trans-membrane domain is responsible for Aβ peptide formation [60]. 3) The secondary structure analysis identified a higher alpha-helical content ∼37% in AβPP indicating their potent ability to tolerate mutations, since helices can accrue more mutations than beta strands without structural distortion due to their higher numbers of inter-residue contacts [61]. 4) Their tertiary structure revealed two disulfide linkages (cysteine144-cystein174 and cysteine158-cysteine186) that imparted stability to the AβPP. Further, the potential ubiquitination sites in AβPP has identified by adopting the sequence identity, sequence similarity and protein-peptide docking approach using lysine site features of ubiquitin, for being the eminent protein for poly-ubiquitination [62, 63]. The comparative analysis of thus predicted ubiquitination sites with the potential sites identified by the machine learning approaches revealed K351 as most promising ubiquitination site at high confidence; K393, K401, K510, and K522 at medium confidence; and K503 and K724 at low confidence. Additionally, the site specific conserved amino acids, glutamic acid (E), glutamine (Q), valine (V), leucine (L), isoleucine (I), and methionine (M), were identified neighboring the lysine residue in AβPP. These conserved polar residues Glu(E) and Gln(Q) at ubiquitination site would display their propensities for salt-bridges while conservation of hydrophobic residues Val(V), Leu(L), Ile(I), and Met(M) indicate their propensities for buried contacts signifying their role in the ubiquitination reaction [64]. Interestingly, we can comprehend the presence of polar negatively charged Glu(E) amino acids neighboring lysine for their electrostatic interactions among each other in their native folded state [65] while ubiquitination directs lysine’s covalent attachment with c-terminal glycine of ubiquitin molecule that opened the avenues for further validations.
GSK3, Glycogen synthase kinase-3; CK1, Casein kinase-1; CK2, Casein kinase-2; PIKK, Phosphatidylinositol 3-kinase-related kinase; FHA, Forkhead-associated domain; SUMO, Small Ubiquitin-like Modifier; NEK2, NIMA Related Kinase 2; PKB, Protein Kinase B; PKC, Protein Kinase C; LATS, Large tumor suppressor kinase 1; TRAF2, TNF receptor-associated factor 2.

Lysine residues in Ubiquitin and AβPP Interactions. A) Prediction of AβPP– Ubiquitin Interactions: The strongest evidence for AβPP–Ubiquitin interactions was reported by SNet, i.e., sum of edge weights along the shortest path between homologous proteins in a protein-protein interaction network followed by overall interactions (SAll), statistical propensities of domain-domain interactions (SDom), and sequence similarities to a known interacting protein pair (SSeq). B) Important Lysine for AβPP– Ubiquitin Interactions: The AβPP-Ubiquitin interactions revealed the role of diverse lysine residues in the interaction of different AβPP domains and Ubiquitin including K6, K11, K33, K48, and K63 in ubiquitin and K66, K99, K155, K447, K687 (K16 in Aβ), and K699 (K28 in Aβ) in AβPP.

Protein-Protein Interaction Network. A) Ubiquitination Enzyme Network for AβPP: The interaction network of AβPP and amyloid cascade proteins (BACE1, BACE2, PSEN1, PSEN2, PSENEN, NCSTN, APH1A) with the ubiquitin E3 ligases are shown in molecular action view. Ubiquitin E3 ligases, Park2 and STUB1 along with deubiquitinases, USP25 and UCHL1 interact with AβPP to regulate its ubiquitination process. Other E3 s including FBXO2, CDH1, TRIM13, TRIM55, and SART1 along with deubiquitinases USP8, USP25, USP39, and UCHL1 regulate the ubiquitination of amyloid cascade enzymes β-secretase (BACE1) and γ-subunit complex (PSEN1, PSNEN, NCSTN). The network’s whole genome statistical analysis identified the interacting proteins associated with the proteolysis process, endopeptidase activity, intracellular membrane-bound organelle cellular component and AD pathway. B) E1s and E2s Interaction with PARK2: The network identified different ubiquitin E2 conjugating enzymes, UBE2A, UBE2B, UBE2G1, UBE2J1, UBE2K, UBE2L3, UBE2L6, UBE2N, UBE2Q1, UBE2S, UBE2U, and UBE2Z; and ubiquitin E1 activating enzymes, UBA1, UBA6, UBA7, and ATG7 associated with the conjugation of lysine to Park2. C) E1s and E2s Interaction with STUB1: The network identified different ubiquitin E2 conjugating enzymes-UBE2A, UBE2B, UBE2G1, UBE2I, UBE2J1, UBE2K, UBE2L3, UBE2L6, UBE2N, UBE2Q1, UBE2S, UBE2T, UBE2U, UBE2W, UBE2Z; and ubiquitin E1 activating enzymes-UBA1, UBA6, UBA7, ATG7 associated with the conjugation of lysine to STUB1. STUB1-STIP1 homology and U-box containing protein 1; PSENEN-Presenilin enhancer 2 homolog; CDH1-Cadherin 1; UCHL1-Ubiquitin carboxyl-terminal esterase L1; APP-Amyloid beta (A4) precursor protein; USP25-Ubiquitin specific peptidase 25; NCSTN-Nicastrin; TRIM13-Tripartite motif containing 13; USP8-Ubiquitin specific peptidase 8; SART1-Squamous cell carcinoma antigen recognized by T cells; USP39-Ubiquitin specific peptidase 39; BACE1-Beta-site AβPP-cleaving enzyme 1; TRIM55-Tripartite motif containing 55; PSEN1-Presenilin 1; BACE2-Beta-site AβPP-cleaving enzyme 2; FBXO2-F-box protein 2; PSEN2-Presenilin 2; PARK2-Parkinson protein 2; APH1A-Anterior pharynx defective 1 homolog A.
Moreover, the mutational analysis of lysine’s impact on AβPP stability revealed only arginine for imparting site-specific stability to the AβPP while rest mutations were detrimental for the structural conformation of AβPP. The stability associated with arginine mutations can be attributed to its highly basic nature and resonating states than lysine that can stabilize the positive charges in protein. The arginine has also shown protein stability results with other proteins [66] thereby also indicating their role in structural stability of AβPP. Interestingly, lysine residues at high confidence ubiquitination sites were intolerant against all mutations thereby affecting AβPP stability and signifying their importance in protein’s structural and functional regulation. Further, the evaluation of lysine mutations for disease susceptibility indicated their least effect on the high confidence ubiquitination sites than the other sites. This situation can be extrapolated that ubiquitination may be more selective towards highly conserved or stable lysine sites than the least stable sites as observed by Kim and Hahn, who reported the gain of ubiquitination sites in the highly conserved region of proteins [67]. Further analysis by Mutpred identified lysine residues to play a role in AβPP acetylation, glycosylation, phosphorylation, and SUMOylation apart from the ubiquitination. Additionally, investigations on AβPP-ubiquitin non covalent electrostatic, polar interactions reported K66, K99, K155, K447, K687, and K699 lysine residues having their plausible role in ubiquitin positioning or ubiquitin-AβPP conjugation that are the foundations for further research. Last but not the least, we investigated the interaction network of E1s, E2s, E3s, and DUBs enzymes for the clearance of AβPP in AD biology. The network revealed Parkin and STUB1 to be the key ubiquitin E3 ligases and USP25 and UCHL1 to be the key deubiquitinases directly involved in the ubiquitination of AβPP along with more than a dozen of E2 conjugating and E1 activating enzymes. While other E3 ligases such as FBXO2, TRIM13, CDH1, TRIM55, and SART1 are reported to regulate the ubiquitination of BACE1, PSENEN, PSEN1, and NCSTN, respectively. Similarly, other deubiquitinases, like USP8 and USP39, regulate BACE1 and NCSTN deubiquitination, respectively. In summary, the indepth studies pertaining to the lysine potential in AβPP processing, stability, interaction, ubiquitination, and other functions are provided herein that needed further investigations in vivo at the molecular level to devise novel therapeutic modalities against AD.
Conclusions
The Aβ synthesis is the consequence of AβPP processing by the β- and γ-secretases in neurons through amyloidogenic pathway, which is tightly regulated by the ubiquitination process. Moreover, any disruption in the ubiquitination of AβPP and Aβ, selectively amplify Aβ level that subsequently triggers AD pathogenesis. The identification of key lysine governing ubiquitination is not only crucial for regulating Aβ level in neurons but also for determining the mechanism of AβPP’s subcellular trafficking and processing. Further identification of potential E3 ligases for directly (Parkin and STUB1) or indirectly (FBXO2, TRIM13, CDH1, TRIM55, and SART1) restraining Aβ production served as key therapeutic candidates for targeting neurodegenerative pathologies. Moreover, their identification is also crucial for developing chimera products like PROTACs for achieving selective protein degradation in the diseased state, which is an interesting area of research for the scientist [68]. Moreover, the finding of conserved residues near ubiquitination sites will aid in better understanding the mechanism behind lysine selection by E3 ligases and their interactions for ubiquitin positioning to govern lysine specific (K6, K11, K27, K29, K33, K48, K63) polyubiquitination. Additionally, we also reported that lysine residue of target protein may be involved in the ubiquitin positioning for determining the type of poly-ubiquitination chain during ubiquitin attachment. The present research has revealed the dynamics of Aβ or AβPP ubiquitination, which is essential to device, the strategies to regulate the AβPP metabolism. This would enable us to avoid the complications arisen from the elimination of secretases mediated biological functions of AβPP fragments: ectodomain-sAβPPα, sAβPPβ, N-terminal-APP-NTFs, and intracellular-AICD in regulating gene transcriptions [69]. Moreover, AβPP is proposed to be a cell-surface receptor [70]; therefore, AβPP ubiquitination may also serve as a signaling event for some unknown cellular processes apart from merely a signal for degradation. Further investigations are required to develop the prospective therapeutic agents that can address the clearance of such toxic proteins (Aβ) or their progenitors (AβPP) in a regulated way to ameliorate the neurodegenerative diseases like AD globally.
